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Abstract 

Wc provide a Polynomial Time Approximation Scheme for the m,ulti- dimensional unit-demand 
pricing problem, when the buyer's values are independent (but not necessarily identically dis- 
tributed.) For all e > 0, we obtain a (1 + e)-factor approximation to the optimal revenue in 
time polynomial, when the values are sampled from Monotone Hazard Rate (MHR) distributions, 
quasi-polynomial, when sampled from regular distributions, and polynomial in n^°^^(^°Br) ^ when 
sampled from general distributions supported on a set [umin^rumin]- We also provide an additive 
PTAS for all boimded distributions. 

Our algorithms are based on novel extreme value theorems for MHR and regular distributions, 
and apply probabilistic techniques to understand the statistical properties of revenue distributions, 
as well as to reduce the size of the search space of the algorithm. As a byproduct of our techniques, 
we establish structural properties of optimal solutions. We show that, for all e > 0, (?(l/e) distinct 
prices suffice to obtain a (l-l-e)-factor approximation to the optimal revenue for MHR distributions, 
where (7(l/e) is a quasi-linear function of 1/e that docs not depend on the number of items. 
Similarly, for all e > and n > 0, g{l/e ■ logn) distinct prices suffice for regular distributions, 
where n is the number of items and g{-) is a polynomial function. Finally, in the i.i.d. MHR case, 
wc show that, as long as the number of items is a suSiciently large function of 1/e, a single price 
suffices to achieve a (1 -|- e)-factor approximation. 



1 Introduction 



Here is a natural pricing problem: A seller has n items to sell to a buyer who is interested in buying 
a single item. The seller wants to maximize her profit from the sale, and wants to leverage stochastic 
knowledge she has about the buyer to achieve this goal. In particular, we assume that the seller has 
access to a distribution T from which the values (fi, . . . of the buyer for the items are drawn. 
Given this information, she needs to compute prices pi, . . . ,p„ for the items to maximize her revenue, 
assuming that the buyer is quasi-linear — i.e. will buy the item i maximizing Vi — Pi, as long as this 
difference is positive. Hence, the seller's expected payoff from a price vector P = (pi, . . . ,pn) is 



where we assume that the arg max breaks ties in a consistent way, if there are multiple maximizers. 
A more sophisticated seller could try to improve her payoff by pricing lotteries over items, i.e. price 
randomized allocations (see [3],) albeit this may be less natural than item pricing. 

While the problem looks simple, it exhibits a rich behavior depending on the nature of J-. For 
example, if J- assigns the same value to all the items with probability 1, i.e. when the buyer always 
values all items equally, the problem degenerates to — what Economists call — a single- dimensional 
setting. In this setting, it is obvious that lotteries do not improve the revenue and that an optimal 
price vector should assign the same price to all items. This observation is a special case of a more 
general, celebrated result of Myerson [H] on optimal mechanism design (i.e. the multi-buyer version 
of the above problem, and generalizations thereof.) Myerson's result provides a closed- form solution 
to this generalized problem in a single sweep that covers many settings, but only works under the 
same limiting assumption that every buyer is single-dimensional, i.e. receives the same value from all 
the items (in general, the same value from all outcomes where she is provided service.) 

Following Myerson's work, a large body of research in both Economics and Engineering has been 
devoted to extending this result to the multi- dimensional setting, i.e. when the buyers' values come 
from general distributions. And while there has been sporadic progress (see survey [13j and its 
references,) it appears that we are far from an optimal multi-dimensional mechanism, generalizing 
Myerson's result. In particular, there is no optimal solution known to even the single-buyer problem 
presented above. Even the ostensibly easier version of that problem, where the values of the buyer 
for the n items are independent and supported on a set of cardinality 2 appears difficult. [] 

Motivated by the importance of the problem to Economics, and intrigued by its simplicity and 
apparent hardness, we devote this paper to the multi-dimensional pricing problem. Our main contri- 
bution is to develop the first near-optimal algorithms for this problem, when the buyer's values are 
independent (but not necessarily identically distributed) random variables. 

Previous work on this problem by Chawla et al. [5l |6] provides factor 2 approximation to the 
revenue achieved by the optimal price vector. The elegant observation enabling this result is to 
consider the following mental experiment: suppose that the unit-demand buyer is split into n "copies" 
ti, . . . ,tn- Copy ti is only interested in item i and her value for that item is drawn from the distribution 
J^i (where J-'i is the marginal of J- on item i), independently from the values of the other copies. On 
the other hand, the seller has the same feasibility constraints as before: only one item can be sold 
in this auction. It is intuitively obvious and can be formally established that the seller in the latter 
scenario is in better shape: there is more competition in the market and this can be exploited to 
extract more revenue. So the revenue of the seller in the original scenario can be upper bounded by 
the revenue in the hypothetical scenario. Moreover, the latter is a single-parameter setting; hence, 

'^Incidentally, the problem is trickier than it originally seems, and various intuitive properties that one would expect 
from an optimal solution fail to hold. See Appendix lAl for an example. 
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we understand exactly how its optimal revenue behaves by Myerson's result. So we can go back to 
our original setting and design a mechanism whose revenue comes close to Myerson's revenue in the 
hypothetical scenario. Using this approach [6] obtains a 2-approximation to the optimal revenue. 
Moreover, if the distributions {J^i}i are regular (this is a commonly studied class of distributions in 
Economics,) the corresponding price vector can be computed efficiently. 

Nevertheless, there is an inherent loss in the approach outlined above, as the revenue obtained 
by the sought-after mechanism will eventually be compared to a revenue that is not the optimal 
achievable revenue in the real setting, but the optimal revenue in a hypothetical setting; and as far as 
we know this could be up to a factor of 2 larger than the real one. So it could be that this approach 
is inherently limited to constant factor approximations. We are interested instead in efficient pricing 
mechanisms that achieve a (1 — e)-fraction of the optimal revenue, for arbitrarily small e. We show 

Theorem 1 (PTAS for MHR Distributions). For all e > 0, there is a Polynomial Time Approx- 
imation Scheme^ for computing a price vector whose revenue is a (1 + e)-factor approximation to 
the optimal revenue, when the values of the buyer are independent and drawn from Monotone Hazard 
Rate distributions. (This is a commonly studied class of distributions in Economics — see Section^) 
For all e > 0, the algorithm runs in time nP"^^^^/'^) . 

Theorem 2 (Quasi-PTAS for Regular Distributions). For all e > 0, there is a Quasi-Polynomial 
Time Approximation Scheme^for computing a price vector whose revenue is a {1 -\- e) -factor approx- 
imation to the optimal revenue, when the values of the buyer are independent and drawn from regular 
distributions. (These contain MHR and are also commonly studied in Economics — see Section^) 
For all e > 0, the algorithm runs in time nP°^y('°s°'^/'^) . 

Theorem 3 (General Algorithm). For all e > 0, there is an algorithm for computing a price vec- 
tor whose revenue is a (1 + e)-factor approximation to the optimal revenue, whose running time is 
ri^po\y{-,\ogT) yjf^Q^ ^/jg ydiii^Qg of the buyer are independent and distributed in an interval [uminirumin\^ 

Theorem 4 (Additive PTAS-General Distributions). For all e > 0, there is a PTAS for computing 
a price vector whose revenue is within an additive e of the optimal revenue, when the values of the 
buyer are independent and distributed in [0, 1]. 

Structural Theorems. Our approach is different than that of \5, "6] in that we study directly the 
optimal revenue (as a random variable,) rather than only relating its expectation to a benchmark 
that may be off by a constant factor. Clearly, the optimal revenue is a function of the values (which 
are random) and the optimal price vector (which is unknown). Hence it may be hard to pin down its 
distribution exactly. Nevertheless, we manage to understand its statistical properties sufficiently to 
deduce the following interesting structural theorems. 

Theorem 5 (Structural 1: A Constant Number of Distinct Prices Suffice for MHR Distributions). 

There exists a (quasi-linear) function g{-) such that, for all e > and all n > 0, g{l/e) distinct prices 
suffice for o (1 + e)- approximation to the optimal revenue when the buyer's values for the n items are 
independent and MHR. These distinct prices can be computed efficiently from the value distributions. 

Theorem 6 (Structural 2: A Polylog Number of Distinct Prices Suffice for Regular Distributions). 

There exists a (polynomial) function g{-) such that, for alle> andn > 0, ^(l/e-logn) distinct prices 
suffice for a {1 + e)- approximation to the optimal revenue, when the buyer's values for the n items are 
independent and regular. These prices can be computed efficiently from the value distributions. 



^Recall that a Polynomial Time Approximation Scheme (PTAS) is a family of algorithms indexed by a 

parameter e > 0, such that for every fixed e > 0, Ae runs in time polynomial in the size of its input. 

^Recall that a Quasi Polynomial Time Approximation Scheme (Quasi-PTAS) is a family of algorithms indexed 
by a parameter e > 0, such that for every fixed e > 0, Ae runs in time quasi-polynomial in the size of its input. 

"^We point out that a straightforward application of the discretization proposed by Nisan (see O) or Hartline and 
Koltun [11 would only give a (^ logr)'^'"'-time algorithm. 



Theorem [5] shows that, when the values are MHR independent, then only the desired approximation e 
dictates the number of distinct prices that are necessary to achieve a (1 + e)-approximation to the 
optimal revenue, and the number of items n as well as the range of the distributions are irrelevant (!) 
Theorem [6] generalizes this to a mild dependence on n for regular distributions. Establishing these 
theorems is quite challenging, as it relies on a deep understanding of the properties of the tails of MHR 
and regular distributions. For this purpose, we develop novel extreme value theorems for these classes 
of distributions (Theorems 12 and 14 in Sections [s] and |4] respectively.) Our theorems bound the 



size of the tail of the maximum of n independent (but not necessarily identically distributed) random 
variables, which are MHR or regular respectively, and are instrumental in establishing the following 
truncation property: truncating all the values into a common interval of the form [a, poly(l/e)Q!] 
in the MHR case, and [a,poly(n, l/e)a] in the regular case, for some a that depends on the value 
distributions, only loses a fraction of e of the optimal revenue. This is quite remarkable, especially in 
the case that the value distributions are non-identical. Why should most of the contribution to the 
optimal revenue come from a restricted set as above, when each of the underlying value distributions 
may concentrate on different supports? We expect that our extreme value theorems will be useful in 
future work, and indeed they have already been used [8j . As a final remark, we would like to point out 
that extreme value theorems have been obtained in Statistics for large classes of distributions |9j , and 
indeed those theorems have been applied earlier in optimal mechanism design [2]. Nevertheless, known 
extreme value theorems are typically asymptotic, only hold for maxima of i.i.d. random variables, 
and are not known to hold for all MHR or regular distributions. 

Covers of Revenue Distributions. Our structural theorems enable us to significantly reduce the 
search space for an (approximately) optimal price vector. Nevertheless, our value distributions are 
not necessarily identically distributed, so the search space remains exponentially large even for the 
MHR case, where a constant (function of e only) number of distinct prices suffice by Theorem [5] Even 
if there are only 2 possible prices, how can we efficiently decide what price to give to each item if the 
items are not i.i.d? The natural approach would be to cluster the distributions into a small number 
of buckets, containing distributions with similar statistical properties, and proceed to treat all items 
in a bucket as essentially identical. However, the problem at hand is not sufficiently smooth for us 
to perform such bucketing and several intuitive bucketing approaches fail. We can obtain a delicate 



discretization of the support of the distributions into a small set (Lemma 48 ) , but cannot discretize 
the probabilities used by these distributions into coarse-enough accuracy, arriving at an impasse with 
discretization ideas. 

Our next conceptual idea is to shift the focus of attention from the space of input value distri- 
butions^ which is inherently exponential, to the space of all possible revenue distributions, which are 
scalar random variables. (As we mentioned earlier, the revenue from a given price vector can be 
viewed as a random variable that depends on the values.) There are still exponentially many possible 
revenue distributions (one for each price vector,) but we find a way to construct a sparse 5-covei of 
this space under the total variation distance between distributions. The cover is implicit, i.e. it has 
no succinct closed-form description. We argue instead that it can be produced by a dynamic program, 
which considers prefixes of the items and constructs sub-covers for the revenue distributions induced 
by these prefixes, pruning down the size of the cover before growing it again to include the next 
item. Once a cover of the revenue distributions is obtained in this way, we argue that there is only 
a (5-fraction of revenue lost by replacing the optimal revenue distribution with its proxy in the cover. 
The high-level structure of the argument is provided in Section |6j and the details are in Section [7j 
Finally, the proofs of our algorithmic results (Theorems [l| [2| [s] and [4]) are given in Section [jj 

Extensions and Related Work A natural conjecture is that, when the distributions are not 
widely different, a single price should suffice for extracting a (1 — e)-fraction of the optimal revenue; 
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that is, as long as there is a sufficient number of items for sale. We show such a result in the case 
that the buyer's values are i.i.d. according to a MHR distribution. See Appendix [K} 

Theorem 7 (Structural 3 (i.i.d.): A Single Price Suffices for MHR Distributions). There is a 
function g{-) such that, for any e > 0, if the number of items n > g^l/e) then a single price suffices 
for a (1 + e) -factor approximation to the optimal revenue, if the buyer's values are i.i.d. and MHR. 

Another interesting byproduct of our techniques is that any constant-factor approximation to the 
optimal pricing can be converted into a PTAS or a quasi-PTAS respectively in the case of MHR or 
regular value distributions. This result (whose proof is given in Appendix [j]) is a direct product of 
our extreme value theorems, which can be boot-strapped with a constant factor approximation to 
OPT. Having such approximation would obviate the need to use our generic algorithm, outlined in 
the proofs of Theorems [5] and [6} 

Theorem 8 (Constant Factor to Near-Optimal Approximation). // we have a constant-factor ap- 
proximation to the optimal revenue of an instance of the pricing problem where the values are either 
MHR or regular, we can use this to speed-up our algorithms of Theorems^ and^ 

Future and Related Work. In conclusion, this paper provides the ffist near-optimal efficient 
algorithms for interesting instances of the multi-dimensional mechanism design problem, for a unit- 
demand bidder whose values are independent (but not necessarily identically distributed.) Our results 
provide algorithmic, structural and probabilistic insights into the properties of the optimal determin- 
istic mechanism for the case of MHR, regular, and more general distributions. It would be interesting 
to extend our results (algorithmic and/or structural) to more general distributions, to mechanisms 
that price lotteries over items [IZlEj, to bundle-pricing [12] and to budgets [HdS]. We can certainly 
obtain such extensions, albeit when sizes of lotteries, bundles, etc. are a constant. We believe that our 
extreme value theorems, and our probabilistic view of the problem in terms of revenue distributions 
will be helpful in obtaining more general results. We also leave the complexity of the exact problem 
as an open question, and conjecture that it is A^P-hard, referring the reader to [1] for hardness results 
in the case of correlated distributions. 

Finally, it is important to solve the multi-bidder problem, extending Myerson's celebrated mecha- 
nism to the multi-dimensional setting, and the results of [Il[6] beyond constant factor approximations. 
In recent work, Daskalakis and Weinberg [8] have made progress in this front obtaining efficient mech- 
anisms for multi-bidder multi-item auctions. These results are neither subsumed, nor subsume the 
results in the present paper. Indeed, we are more general here in that we allow the buyer to have 
values for the items that are not necessarily i.i.d., an assumption needed in [H] if the number of items 
is large. On the other hand, we are less general in that (a) we solve the single-bidder problem and (b) 
are near-optimal with respect to all deterministic (i.e. item-pricing), but not necessarily randomized 
(lottery-pricing) mechanisms. Strikingly, the techniques of the present paper are essentially orthogo- 
nal to those of |8j. The approach of [8] uses randomness to symmetrize the solution space, coupling 
this symmetrization with Linear Programming formulations of the problem. Our paper takes instead 
a probabilistic approach, developing extreme value theorems to characterize the optimal solution, and 
designing covers of revenue distributions to obtain efficient algorithmic solutions. It is tempting to 
conjecture that our approach here, combined with that of [8J would lead to more general results. 
Indeed, our extreme value theorems found use in [8], but we expect that significant technical work is 
required to go forward. 

2 Preliminaries 

For a random variable X we denote by Fx{x) the cumulative distribution function of X, and by fxi^) 
its probability density function. We also let = sup{x\Fx{x) = 0} and u^^ax — inf{x|Fx(a;) = !}• 

4 



^g^^ may be +00, but we assume that u^^^ > 0, since the distributions we consider in this paper 



represent value distributions of items. Moreover, we often drop the subscript or superscript of X, if X 
is clear from context. A natural question is how distributions are provided as input to an algorithm 
(explicitly or with oracle access). We discuss this technical issue in Appendix [C] We also define 
precisely what it means for an algorithm to be "efficient" in each case. We continue with the precise 
definition of Monotone Hazard Rate (MHR) and Regular distributions, which are both commonly 
studied classes of distributions in Economics. 

Definition 9 (Monotone Hazard Rate Distribution). We say that a one- dimensional differentiahle 



distribution F has Monotone Hazard Rate, shortly MHR, if iL^pJ^^.) non- decreasing in [u. 

Definition 10 (Regular Distribution). A one- dimensional differentiahle distribution F is called reg- 
ular if X — ^ is non- decreasing in [umin,Umax]- 

It is worth noticing that all MHR distributions are also regular distributions, but there are regular 
distributions that are not MHR. The family of MHR distributions includes such familiar distributions 
as the Normal, Exponential, and Uniform distributions. The family of regular distributions contains a 
broader range of distributions, such as fat-tail distributions fx{x) ~ x^^^^"") for a > 1 (which are not 
MHR). In Appendix [P] and [E| we establish important properties of MHR and regular distributions. 



rmni "Umax 



These properties are instrumental in establishing our extreme value theorems (Theorems 12 and 14 
in the following sections). 

We conclude this section by defining two computational problems. For the value distributions 
that we consider, we can show that they are well-defined (i.e. have finite optimal solutions.) 

Price: Input: A collection of mutually independent random variables {vjjJL]^, and some e > 0. 
Output: A vector of prices (pi, . . . ,pn) such that the expected revenue TZp under this price vector, 
defined as in Eq. ([T]), is within a (1 -|- e)-factor of the optimal revenue achieved by any price vector. 

RestrictedPrice: Input: A collection of mutually independent random variables {uj}"^]^, a 
discrete set V C M^., and some e > 0. 

Output: A vector of prices (pi, . . . ,Pra) G such that the expected revenue IZp under this price 
vector is within a (1 -|- e)-factor of the optimal revenue achieved by any vector in "P". 

3 Extreme Values of MHR Distributions 

We reduce the problem of finding a near-optimal price vector for MHR distributions to finding a 
near-optimal price vector for value distributions supported on a common, balanced interval, where 
the imbalance of the interval is only a function of the desired approximation e > 0. More precisely. 

Theorem 11 (From MHR to Balanced Distributions). Let V = {vi}i^[n] be a collection of mutually 
independent (but not necessarily identically distributed) MHR random variables. Then there exists 
some P = /3(V) > such that for all e G (0,1/4), there is a reduction from Price(V, celog(^)) to 
PRiCE(V,e), where V := {vi}i is a collection of mutually independent random variables supported on 
the set [|/3, 2 log ^/3], and c is some absolute constant. 

Moreover, (3 is efficiently computable from the distributions of the Xi 's (whether we are given the 
distributions explicitly, or we have oracle access to them,) and for every e the running time of the 
reduction is polynomial in the size of the input and ^ . In particular, if we have oracle access to the 
distributions of the Vi 's, then the forward reduction produces oracles for the distributions of the Vi 's, 
which run in time polynomial in n, 1/e, the input to the oracle and the desired oracle precision. 

We discuss the essential elements of this reduction below. Most crucially, the reduction is en- 
abled by the following characterization of the extreme values of a collection of independent, but not 
necessarily identically distributed, MHR distributions. 
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Theorem 12 (Extreme Values of MHR distributions). Let Xi, . . . , Xn be a collection of independent 
(but not necessarily identically distributed) random variables whose distributions are MHR. Then there 
exists some anchoring point (3 such that Pr[maxj{Xj} > /3/2] > 1 — ^ and 



r+oo 

/ t ■ /„,ax.{x.}(i)di < 36/3elog 1/e, for all e e (0, 1/4). (2) 

J2/31ogl/e 



Moreover, (3 is efficiently computable from the distributions of the Xi's (whether we are given the 
distributions explicitly, or we have oracle access to them.) 



Theorem 
of E[maxi Xi 
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(whose proof is in Appendix F.l ) shows that, for ah e, at least a (1— 0(e log ^))-fraction 
is contributed to by values that are no larger than E[maxi Xi] ■ log j. Our result is quite 
surprising, especially for the case of non-identically distributed MHR random variables. Why should 
most of the contribution to E[maxjXj] come from values that are close (within a function of e only) 
to the expectation, when the underlying random variables Xi may concentrate on widely different 
supports? To obtain the theorem one needs to understand how the tails of the distributions of a 
collection of independent but not necessarily identically distributed MHR random variables contribute 
to the expectation of their maximum. Our proof technique is rather intricate, defining a tournament 
between the tails of the distributions. Each round of the tournament ranks the distributions according 
to the size of their tails, and eliminates the lightest half. The threshold /? is then obtained by some 
side-information that the algorithm records in every round. 



Given our understanding of the extreme values of MHR distributions, our reduction of Theorem 11 
from MHR to Balanced distributions proceeds in the following steps: 



We start with the computation of the threshold (3 specified by Theorem 12 This computation 



can be done efficiently, as stated in the statement of the theorem. Given that Pr[maxj{Xj} > 



/3/2] is bounded away from 0, (3 provides a lower bound to the optimal revenue. See Section [F. 2.1 



for the precise lower bound we obtain. Such lower bound is useful as it implies that, if our 
transformation loses revenue that is a small fraction of /3, this corresponds to a small fraction 
of optimal revenue lost. 

Next, using Q we show that, for all e > 0, if we restrict the prices to lie in the balanced interval 
[e- ^, 2 log(^) • we only lose a 0(e log 1/e) fraction of the optimal revenue; this step is detailed 
in Section |Fl221 

Finally, we show that we can efficiently transform the given MHR random variables {t'i}ie[n] 
into a new collection of random variables {'SjjiGH ^^^^ take values in [| • /3,21og(^) • (3] and 
satisfy the following: a near-optimal price vector for the setting where the buyer's values are 
distributed as {t'ijiGH efficiently transformed into a near-optimal price vector for the 

original setting, i.e. where the buyer's values are distributed as {vi}i^[n]. This step is detailed 
in Section If. 2. 3[ 



4 Extreme Values of Regular Distributions 

Our goal is to reduce the problem of finding a near-optimal pricing for a collection of independent 
(but not necessarily identical) regular value distributions to the problem of finding a near-optimal 
pricing for a collection of independent distributions, which are supported on a common finite interval 
[umin, Umax], where Umax/umin < 16n*/e^, where n is the number of distributions and e is the desired 
approximtion. It is important to notice that our bound on the ratio Umax/umin does not depend 
on the distributions at hand, just their number and the required approximation. We also emphasize 
that the input regular distributions may be supported on [0, -|-oo), so it is a priori not clear if we can 
truncate these distributions to any finite set (even of exponential imbalance) without losing revenue. 
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Theorem 13 (Reduction from Regular to Po/y(n)-Balanced Distributions). Let V = {^^i}ie[n.] a 
collection of mutually independent (but not necessarily identically distributed) regular random vari- 
ables. Then there exists some a = a(y) > such that, for any e G (0,1), there is a reduction from 
Price(V, e) to Price(V, e — Q{e/n)), where V = {"UilieM ^ collection of mutually independent 
random variables that are supported on [^r, ^"3° ]. 

Moreover, we can compute a in time polynomial in n and the size of the input (whether we have 
the distributions of the Vi 's explicitly, or have oracle access to them.) For all e, the reduction runs 
in time polynomial in n, 1/e and the size of the input. In particular, if we have oracle access to the 
distributions of the Vi 's, then the forward reduction produces oracles for the distributions of the Vi 's, 
which run in time polynomial in n, 1/ e, the input to the oracle and the desired oracle precision. 

Our reduction is based on the following extreme value theorem for regular distributions, proved 
in Appendix |G.1[ See Appendix G.2 for a discussion of what this theorem means. 



Theorem 14 (Homogenization of the Extreme Values of Regular Distributions). Let be a 

collection of mutually independent (but not necessarily identically distributed) regular random vari- 
ables, where n > 2. Then there exists some a = a{{Xi}i) such that: 

1. a has the following '^anchoring" properties: 

• for all l>l, Pr[Xi > la] < 2/{ln^), for all i £ [n]; 

• a/n^ < c • max2(z • Pr[maxj{Xj} > z]), where c is an absolute constant. 

2. for all e G (0, 1), the tails beyond can be "homogenized" , i.e. 

• for any integer m < n, thresholds ti, . . . ,tm ^ t > ^"2" 1 index set {ai, 



,ara} Q [n]: 



m / 



max{XaJ > t 



7e 




+ — • 




n 





max{XaJ > 



2a 



Finally, a is efficiently computable from the distributions of the Xi's (whether we are given the dis- 
tributions explicitly, or have oracle access to them.) 



Given our homogenization theorem, our reduction of Theorem 13 is obtained as follows. 

• First, we compute the threshold a specified in Theorem 14 This can be done efficiently as 
stated in Theorem 14 Now given the second anchoring property of a, we obtain an Q,{a/n^) 
lower bound to the optimal revenue. Such a lower bound is useful as it implies that we can 
ignore prices below some 0{ea/n^). 



Next, using our homogenization Theorem 14, we show that if we restrict a price vector to lie in 
[ea/n^, 2n^a/e^]"', we only lose a O(^) fraction of the optimal revenue. This step is detailed in 



Appendix G.3.1 



Finally, we show that we can efficiently transform the input regular random variables {t'i}je[n] 
into a new collection of random variables {vi}i,^[n] that are supported on ^^^] and satisfy 
the following: a near-optimal price vector for when the buyer's values are distributed as {vi}i£[n] 
can be efficiently transformed into a near-optimal price vector for when the buyer's values are 
distributed as {vi}i^[n]- This step is detailed in Appendix G.3.2, while Appendix G.3.3 concludes 



the proof of Theorem 13 



5 From Continuous to Discrete Distributions 



We argued that the expected revenue can be sensitive even to small perturbations of the prices and 
the probability distributions. So it is a priori not clear whether there is a coarse discretization of the 
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input and the search space that does not cost a lot of revenue. We show that, if done dehcately, there 
is in fact such coarse discretization. Our discretization result is summarized in Theorem I15[ Notice 
that the obtained discretization does not eliminate the exponentiality of the search or the input space. 

Theorem 15 (Price/Value Distribution Discretization). Let V = {fi}je[n] be a collection of mutually 
independent random variables supported on a finite set \umin,Umax] C M+, and let r = """"" > 1. For 

any e £ (ifk^^TljiTe) ; ^^e'^e is a reduction from Price(V, e) to RestrictedPrice(V, 0(e^)), 
where 

• V = {vi}i£[n] is a collection of mutually independent random variables that are supported on a 
common set of cardinality O ( 



\v\=o 



log r 



Moreover, assuming that the set [umimUmax] is specified in the input, ^we can compute the (com- 
mon) support of the distributions of the variables {vi}i as well as the set of prices V in time polynomial 
in logUmin, log Umax and 1/e. We can also compute the distributions of the variables {vi}i^[n] time 
polynomial in the size of the input and 1/e, if we have the distributions of the variables {i'i}i6[n] 
explicitly. If we have oracle access to the distributions of the variables {fi}jg[n], we can construct an 
oracle for the distributions of the variables {vi}i(^\n\, 

running in time polynomial in logWmm? log Wmax? 
1/e, the input to the oracle and the desired precision. 

That prices can be discretized follows immediately from a discretization lemma attributed to Nisan |5] 



(see also a related discretization in pjp,) and our result is summarized in Lemma 44 of Appendix H.l 
The discretization of the value distributions is inspired by Nisan's lemma, but requires an intricate 
twist in order to reduce the size of the support to be linear in log r rather than linear in log r which 
is what a straightforward modification of the lemma gives. (Indeed, quite some effort is needed to get 



the former bound.) The achieved discretization in the value distributions is summarized in Lemma 48 



of Appendix H.2 



6 Probabilistic Covers of Revenue Distributions 

Let V := {vi}i be an instance of Price, where the Vi^s are mutually independent random variables 
distributed on a finite set [Umin ) Umax 

] according to distributions {Fi}i, and let TZqpt be the optimal 
expected revenue for V. Our goal is to compute a price vector with expected revenue (1 — e)TZoPT- 
Theorem 15 of Section [s] provides an efficient reduction of this problem to the (1 — 6) approxima- 



tion of a discretized problem, where both the values and the prices come from discrete sets whose 
cardinality is 0(logr/(5^), where r = and S = O(e^). For convenience, we denote by {Fi}i the 
resulting discretized distributions, by {vi}i a collection of mutually independent random variables 
distributed according to the Fj's, by {v^^\ v^'^\ . . . , u^'^^^} the (common) support of all the Fj's, and 
by {p^^\p^'^\ . . . of available price levels, where both ki and k2 are 0(logr/(5^). It is 

worth noting that the set of prices satisfies minjp'^*)} > Umin/ (1 + S) and maxjp*^*)} < Umax, and that 
these prices are points of a geometric sequence of ratio 1/(1 — 6"^). (See Lemmas 41 and 44 in the 
Appendix.) 

Having discretized the support sets of values and prices, a natural idea that one would like to 
use to go forward is to further discretize the distributions {Fi}i by rounding the probabilities they 
assign to every point in their support to integer multiples of some fraction a = cr(e, r) > 0, i.e. a 
fraction that does not depend on n. If such discretization were feasible, the problem would be greatly 



^The requirement that the set [uminjUmax] is specified as part of the input is only relevant if we have oracle access 
to the distributions of the Vi's, as if we have them explicitly we can easily find [umin,Umax]- 
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simplified. For example, if additionally r were an absolute constant or a function of e only (as it 
happens for MHR distributions by virtue of Theorem 31), there would only be a constant number 
of possible value distributions (as both the cardinality of the support of the distributions and the 
number of available probability levels would be a function of e only.) In such case, we could try to 
develop an algorithm tailored to a constant number of available value distributions. This is still not 
easy to do (as we don't even know how to solve the i.i.d. case of our problem), but is definitely easier 
to dream of. Nevertheless, the approach breaks down as preserving the revenue while doing a coarse 
rounding of the probabilities appears difficult, and the best discretization we can obtain is given in 
Lemma 49 of Appendix |H.4[ where the accuracy is inverse polynomial in n. 

Given the apparent impasse towards eliminating the exponentiality from the input space of our 
problem, our solution evolves in a radically different direction. To explain our approach, let us view 
our problem in the graphical representation of Figure [T] of Appendix |B] Circuit C takes as input a 
price vector pi, . . . ,pn and outputs the distribution of the revenue of the seller under this price 

vector. Indeed, the revenue of the seller is a random variable Rp whose value depends on the variables 
{'^i}ie[n]- So in order to compute the distribution of the revenue the circuit also uses the distributions 
which are hard-wired into the circuit. Let us denote the expectation of Rp as TZp. 
Given our restriction of the prices to the finite set {p^^\p^'^\ . . . there are k"^ possible inputs 

to the circuit, and a corresponding fcg number of possible revenue distributions that the circuit can 
produce. Our main conceptual idea is this: instead of worrying about the set of inputs to circuit C, 
we focus on the revenue distribution directly, constructing a probabilistic cover (under an appropriate 
metric) of all the possible revenue distributions that can be output by the circuit. The two crucial 
properties of our cover are the following: (a) it has cardinality 0(nP°^^^E ''°^^^), and (b) for any 
possible revenue distribution that the circuit may output, there exists a revenue distribution in our 
cover with approximately the same expectation. 

Details of the Cover. At a high level, the way we construct our cover is via dynamic programming, 
whose steps are interleaved with coupling arguments pruning the size of the DP table before proceeding 
to the next step. Intuitively, our dynamic program sweeps the items from 1 through n, maintaining 
a cover of the revenue distributions produced by all possible pricings on a prefix of the items. More 
precisely, for each prefix of the items, our DP table keeps track of all possible feasible collections of 
ki X k2 probability values, where Prj^^jj) h G [^i]>*2 S [^2], denotes the probability that the item with 
the largest value-minus-price gap (i.e. the item of the prefix that would have been sold in a sale that 
only sales the prefix of items) has value u^*^^ for the buyer and is assigned price p^*^^ by the seller. I.e. 
we memoize all possible (winning-value, winning-price) distributions that can arise from each prefix 
of items. The reasons we decide to memoize these distributions are the following: 

• First, if we have these distributions, we can compute the expected revenue that the seller would 
obtain, if we restricted our sale to the prefix of items. 

• Second, when our dynamic program considers assigning a particular price to the next item, then 
having the (winning- value, winning-price) distribution on the prefix suffices to obtain the new 
(winning-value, winning-price) distribution that also includes the next item. I.e., if we know 
these distributions, we do not need to keep track of anything else in the history to keep going. 
Observe that it is crucial here to maintain the joint distribution of both the winning-value and 
the winning-price, rather than just the distribution of the winning-price. 

• In the end of the program, we can look at all feasible (winning- value, winning-price) distributions 
for the full set of items to find the one achieving the best revenue; we can then follow back- 
pointers stored in our DP table to uncover a price vector consistent with the optimal distribution. 

All this is both reasonable, and fun, but thus far we have achieved nothing in terms of reducing 
the number of distributions Fj^ in our cover. Indeed, there could be exponentially many (winning- 
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value, winning-price) distributions consistent with each prefix, so that the total number of distributions 
that we have to memoize in the course of the algorithm is exponentially large. To obtain a polynomially 
small cover we show that we can be coarse in our bookkeeping of the (winning-value, winning-price) 
distributions, without sacrificing much revenue. Indeed, it is exactly here where viewing our problem 
in the "upside-down" way illustrated in Figure [T] (i.e. targeting a cover of the output of circuit C 
rather than figuring out a sparse cover of the input) is important: we show that, as far as the expected 
revenue is concerned, we can discretize probabilities into multiples of ^^^^3 after each round of the DP 
without losing much revenue, and while keeping the size of the DP table from exploding. That the 
loss due to pruning the search space is not significant follows from a joint application of the coupling 
lemma and the optimal coupling theorem (see, e.g., |10j). after each step of the Dynamic Program. 



7 The Algorithm for the Discrete Problem 

In this section, we formalize our ideas from the previous section, providing our main algorithmic 
result. We assume that the pricing problem at hand is discrete: the value distributions are supported 
on a discrete set S = {v^^\v'^'^\ . . . , w'^'^i)}, and the sought after price vector also lies in a discrete set 
{^p^^\ ■ ■ ■ , where both S and V := {^p^^\ ■ ■ ■ are given explicitly as part of the input, 

while our access to the value distributions may still be either explicit or via an oracle. We denote by 
OPT the optimal expected revenue for this problem, when the prices are restricted to set V. 



The Algorithm. As a first step, we invoke Lemma 49 of Appendix II.4 , obtaining a polynomial- 
time reduction of our problem into a new one, where additionally the probabilities that the value 
distributions assign to each point in S is an integer multiple of l/(rn)'^, where r = max|^|^|. The 
loss in revenue from this reduction is at most an additive min{p(*)}. Moreover, the construction of 



Lemma 49 is explicit, so from now on we can assume that we know the value distributions explicitly. 
Let us denote by {Fi}i the rounded distributions and set m := rn throughout this section. 

The second phase of our algorithm is the Dynamic Program outlined in Section [6j We provide some 
further details on this next. Our program computes a Boolean function (7(i,Pr), whose arguments 
lie in the following range: i G [n] and Pr = (Pri^i, Pri^2; • • • > Prfci,fc2)) where each Pr^^^ij G [0,1] is 
an integer multiple of The function g is stored in a table that has one cell for every setting of 
i and Pr, and the cell contains a or a 1 depending on the value of g at the corresponding input. 
In the terminology of the previous section, argument i indexes the last item in a prefix of the items 
and Pr is a (winning- value, winning-price) distribution in multiples of If Pr can arise from some 
pricing of the items (up to discretization of probabilities into multiples of ^), we intend to 

store (7(1, Pr) = 1; otherwise we store g{i,VT) = 0. 

Due to lack of space we postpone the straightforward details of the Dynamic Program to Ap- 



pendix I.l Very briefiy, the table is filled in a bottom-up fashion from i = 1 through n. At the 
end of the i-th iteration, we have computed all feasible "discretized" (winning- value, winning-price) 
distributions for the prefix 1 . . . i, where "discretized" means that all probabilities have been rounded 
into multiples of 1/m^. For the next iteration, we try all possible prices p^^^ for item i + 1 and compute 
how each of the feasible discretized (winning- value, winning-price) distributions for the prefix 1 . . A 
evolves into a discretized distribution for the prefix 1 . . . i -|- 1 , setting the corresponding cell of layer 
g{i + 1, •) of the DP table to 1. Notice, in particular, that we lose accuracy in every step of the 
Dynamic Program^ as each step involves computing how a discretized distribution for items 1 . . .i 
evolves into a distribution for items 1 . . . i + 1 and then rounding the latter back again into multiples 
of 1 /m^ . We show in the analysis of our algorithm that the error accumulating from these roundings 
can be controlled via coupling arguments. 

After computing g^s table, we look at all cells such that g{n, Pr) = 1 and evaluate the expected 
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revenue resulting from the distribution Pr, i.e. 



n6[A:i],j2e[A:2] 

Having located the cell whose TZ^ is the largest, we follow back-pointers to obtain a price vector 
consistent with Pr. At some steps of the back-tracing, there may be multiple choices; we pick an 
arbitrary one to proceed. 

Running Time and Correctness. Next we bound the algorithm's running time and revenue. 

Lemma 16. Given an instance of RestrictedPrice, where the value distributions are supported 
on a discrete set S of cardinality ki and the prices are restricted to a discrete set V of cardinality k2, 
the algorithm described in this section produces a price vector with expected revenue at least 

(2kik2 ^ 16\ . 
\ [nry n J 

where OPT is the optimal expected revenue, m.m{V} is the lowest element ofV, and r is the ratio of 
the largest to the smallest element ofV. 

Lemma 17. The running time of the algorithm is polynomial in the size of the input and {nr)^^^'^^'^\ 

Due to space limitations, we postpone the proofs of these lemmas to Appendix |lj Intuitively, if we 
did not perform any rounding of distributions, our algorithm would have been exact, outputting an 
optimal price vector in we show is that the roundings performed at the steps 

of the dynamic program are fine enough that do not become detrimental to the revenue. To show 
this, we use the probabilistic concepts of total variation distance and coupling of random variables, 
invoking the coupling lemma and the optimal coupling theorem after each step of the algorithm. (See 



Lemma 50 in Appendix 1.2 ) This way, we show that the rounded (winning- value, winning-price) 



distributions maintained by the algorithm for each price vector are close in total variation distance 



to the corresponding exact distributions arising from these price vectors, culminating in Lemma 16 



Using Lemmas [16] and 17 and our work in previous sections, we obtain our main algorithmic 
results in this paper (Theorems [l| [2| [sj and [4]). See Appendix [j] for the proof of these theorems. 
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Appendix 



A An interesting example 

A natural property than one may expect to hold is that, when the value distributions are discrete, 
there always exists an optimal solution that uses prices from the support of the value distributions. 
It turns out that this is not true. Here is an example: 

Suppose that the seller has two items to sell, and the buyer's values for the items are vi, which 
is uniform on {1,5}, and V2, which is uniform on {3,3.5}. Moreover, assume that, if there is a tie 
between the value-minus-price gap for the two items, the buyer tie- breaks in favor of item 1. We claim 
that in this case the price vector P = (4.5, 3) achieves higher revenue than any price vector that uses 
prices from the set {1, 3, 3.5, 5} (where the values are drawn from.) Let us do the calculation. All our 
calculations are written in the form 

Ttp = Pi X Pr[item 1 is the winner] +P2 x Pr[item 2 is the winner]. 

1. When P = (4.5,3) 

Up = 4.5 X (1/2 X 1) + 3 X (1/2 x 1) = 30/8 

2. When P G {1, 3, 3.5, 5}^: 

• If P = (5, 3.5) then 

Tep = 5 X (1/2 X 1) + 3.5 X (1/2 X 1/2) = 27/8 < 30/8 

• If P = (5,3) then 

7^p = 5 X (1/2 X 1/2) + 3 X (1 X 1/2 + 1/2 X 1/2) = 28/8 < 30/8 

• For any other price vector, the maximum revenue is bounded by 3.5 = 28/8 < 30/8. 

B Figures 



Fi F2 Fa 
I I I I 




Figure 1: The Revenue Distribution. 



C Access to Value Distributions 

In this paper, we consider two ways that a distribution may be input to an algorithm. 

• Explicitly: In this case the distribution has to be discrete, and we are given its support as a 
list of numbers, and the probability that the distribution places on every point in the support. 
If a distribution is provided explicitly to an algorithm, the algorithm is said to be efficient, if 
it runs in time polynomial the description complexity of the numbers required to specify the 
distribution. 
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• As an Oracle: In this case, we are given an oracle that answers queries about the value of the 
cumulative distribution function on a queried point. In particular, a query to the oracle consists 
of a point x and a precision e, and the oracle outputs a value of bit complexity polynomial in the 
description of x and e, which is within e from the value of the cumulative distribution function 
at point X. Moreover, we assume that we are given an anchoring point x* such that the value 
of the cumulative distribution at that point is between two a priori known absolute constants 
ci and C2, such that < ci < C2 < 1. Having such a point is necessary, as otherwise it would be 
impossible to find any interesting point in the support of the distribution (i.e. any point where 
the cumulative is different than or 1). 

If a distribution is provided to an algorithm as an oracle, the algorithm is said to be efficient, 
if it runs in time polynomial in its other inputs and the bit complexity of x* , ignoring the time 
spent by the oracle to answer queries (since this is not under the algorithm's control). 

If we have a closed form formula for our input distribution, e.g. if our distribution is M{fJ,, o"^), 
we think of it as given to us as an oracle, answering queries of the form (x, e) as specified above. 
In most common cases, such an oracle can be implemented so that it also runs efficiently in the 
description of the query. 



D Properties of MHR Distributions 

Definition 18. For a random variable X , we define ai = Umin, cind for every real number p E 
(1, +oo), we define Op = inf |x|-F(x) > 1 — ^| . 

The following lemma establishes an interesting property of MHR distributions. Intuitively, the 
lemma provides a lower bound on the speed of the decay of the tail of a MHR distribution. We prove 
the lemma by showing that the function log (l — F[x)^ is concave if F is MHR, and exploiting this 
concavity (see Appendix D.l). 

Lemma 19. If the distribution of a random variable X satisfies MHR, m> 1 andd > 1, d-am > «m'*- 

Next we study the expectation of a random variable that satisfies MHR. We show that the con- 
tribution to the expectation from values > m, is 0{m ■ Pr[X > m]). We start with a definition. 

Definition 20. For a random variable X , let Con[X > x] = K[X\X > x] ■ Pr{A > x} be the 
contribution to expectation of X from values which are no smaller than x, i.e. 



r+oo 

Con[X > x] = X ■ f{x)dx. 

J X 



It is an obvious fact that for any random variable X and any two points xi < X2, Con[X > xi] > 
Con[X > X2]. Using the bound on the tail of a MHR distribution obtained in Lemma [Toj we bound 
the contribution to the expectation of X by the values at the tail of the distribution. The proof is 
given in Appendix [D| 

Lemma 21. Let X be a random variable whose distribution satisfies MHR. For all m>2, Con[X > 
ot-m] < 6am /m. 



D.l Prooofs 



Proof of Lemma 19: It is not hard to see that f{x) > 0, for all x G {umin, Umax)- For a contradiction, 
assume this is not true, that is, for some x' G {umin,Umax), fi^') = 0. We know 1 — F{x') > 0. 
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Thus ^Ip^lr-j = 0. Since the distribution satisfies MHR and 1 — F{x) is positive for all x G {umin, x'), 
f{x) = in this interval. Hence, it must also be that F{x) = in [umin-,x'). Since x' > Umin, it 
follows that Umin 7^ sup{x\F{x) = 0}, a contradiction. 

Since f{x) > in {umin,Umax), is monotone in {umin,Umax)- So we can define the inverse 

F~^{x) in {umimUmax)- It IS not hard to see that for any p G [l,+oo), F{ap) = 1 — 1/p and 
ap = F-^l-l/p). 

Now let G{x) = log(l — F{x)). We will show that G{x) is a concave function. 

Let us consider the derivative of G{x). By the definition of MHR, G'{x) = ^Jp^-^ is monotonically 
non-increasing. Therefore, G{x) is concave. It follows that, for every m, by the concavity of 
the following inequality holds: 

„(d-l 1 \^ d-l^, , 1^, , 

G — — • ai + - • a^d > — — G(ai) + -G(a^d). 



^ d ^ d "^7 - d d 

Let us rewrite the RHS as follows 

d-l 1 
— — G(aij + -G(a^d 

d-l, 1 



logl + -log(l-F(Q„d)) 



d ° d 

a \m 
1 

m 



■log 



Hence, we have the following: 

^log (l - F . ai + i . a„d)) > log 

^, , ^/d-l 1 

=^F{am) ^ F I —J— • "1 + ^ ■ «mrf 

d-^ 1 

=^am > — — ' '^^ ~^ d' ^'^ monotone increasing) 

1 , . 

^=^ci • am > «m<^- 

□ 

Proof of Lemma 21: Let = Gon[X > a™], and consider the sequence {Pi := a (2*)}, defined for all 
non-negative integers i. It can easily be seen that limj_^+oo CKm(2») = Umax] hence, limj_i._(_oo ft = limax 
and by continuity hmj^+oo F{Pi) = F{umax) = 1- 
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Also, 



X ■ f{x)dx < A+i(l - F(ft)) = A+i/m(20. 



ft 



Moreover, Lemma 19 implies that /3j < 2/3j_i; thus, /3j < 2*/3o < 2*am- Hence, we have the following: 

+00 



X 



1=0 1=0 



m 



i=l 



m 



2a. 



+ 



m 



+00 

E 

1=0 



m 



+ 



1 



m? 1 — 2/m? 
^ 2Qf^ ^ 4:am 



< 



m 

.GOrr. 

m 



m 



□ 



E Properties of Regular Distributions 

If F is a differentiable continuous regular distribution, it is not hard to see the following: if /(x) = 
for some x, then f{x') = for all x' > x (as otherwise the definition of regularity would be 
violated.) Hence, if X is a random variable distributed according to F, it must be that f{x) > 
for x G [u^in^'^maxl- So we can define on [itmm! ■"max]) it will be differentiable, since F is 
differentiable and / is non-zero. Now we can make the following definition, capturing the revenue 
of a seller who prices an item with value distribution F, so that the item is bought with probability 
exactly q. 

Definition 22 (Revenue Curve). For a differentiable continuous regular distribution F, define Rp '■ 
[0, 1] — > M as follows 

RF{q)=q-F-\l-q). 
The following is well-known. We include its short proof for completeness. 
Lemma 23. If F is regular, Rf{<i) is a concave function on (0, 1]. 
Proof. The derivative Rpiq) is 

Notice that F~^{1 — q) is monotonically non-increasing in q. This observation and the regularity of 
F imply that R'p{q) is monotonically non-increasing in q. (To see this try the change of variable 
x{q) = F~^{1 — q).) This implies that Rf{q) is concave. □ 

Lemma 24. For any regular distribution F, ifO<q<q<p<l, then 

Rpiq) < -^Rpiq). 
1-p 
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Proof. Since q G [q, 1), there exists a A G (0, 1], such that 



X-q+{l-X)-l = q. 

Hence: A = ]^ > = 1 — p. Now, from Lemma 23, we have that Rf{x) is concave. Thus 



Rpiq) = Rf{X • g+ (1 - A) • 1) > a • Rpiq) + (1 - A) • Rf{1). 
Since Rf{1) > 0, Rpiq) > A • Rpiq) > (1 - p)RF{q)- Thus, Rpiq) < j^Rriq)- □ 
Corollary 25. For any regular distribution F, if q < q < i/ien 

Rpiq) < ^^Rriq)- 
n'^ — 1 

F Details of Section [3]: MHR to Balanced Distributions 



F.l Proof of Theorem 12 (the Extreme Value Theorem for MHR Distributions) 



We start with some useful notation. For all i = 1, . . . , n, we denote by Fi the distribution of variable 
Xi. We also let am '■= inf {x\Fi{x) > 1 — for all m > 1. Moreover, we assume that n is a power 
of 2. If not, we can always include at most n additional random variables that are detreministically 
0, making the total number of variables a power of 2. 



We proceed with the proof of Theorem 12 The threshold /3 is computed by an algorithm. At a 
high level, the algorithm proceeds in O(logn) rounds, indexed by t G {0, . . . ,logn}, eliminating half 
of the variables at each round. The way the elimination works is as follows. In round t, we compute 
for each of the variables that have survived so far the threshold Q!„/2t beyond which the size of the 
tail of their distribution becomes smaller than l/(n/2*). We then sort these thresholds and eliminate 
the bottom half of the variables, recording the threshold of the last variable that survived this round. 
The maximum of these records among the logn rounds of the algorithm is our f3. The pseudocode 
of the algorithm is given below. Given that we may only be given oracle access to the distributions 
{-Fi}jg[„], we allow some slack r/ < ^ in the computation of our thresholds so that the computation 
is efficient. If we know the distributions explicitly, the description of the algorithm simplifies to the 
case r/ = 0. 

Algorithm 1 Algorithm for finding /3 
1: Define the permutation of the variables TTo{i) = i, \/ i G [n], and the set of remaining variables 

Qo = [n]. 
2: for t := to log n — 1 do 

3: ForalljG [n/2*], compute some ^ [I — rj, 1 + 7]]- a^^l'^P\ for a small constant r] £ [0,1/2) 

4: Sort these n/2* numbers in decreasing order nt+i such that 

(7rt+i(l)) ^(7rt+i(2)) (7rt+i(n/2')) 



5: Qt+1 ■■= { TTt+iii) I i < n/2*+i } 

Pi ■^n/2' 

end for 



6: 
7: 
8: 

9: Set /3iogn '■— ^2 ^ 

10: Output j3 := maxj /3j 



Compute g [i _ ^, i + ^] . a^"'-"^')^ 



Crucial in the proof of the theorem is the following lemma. 
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Lemma 26. For all i G [n] and e G (0, 1), let Si = Con[Xi > 21og(^) • /?], where Con[-] is defined as 
in Definition 20. Then 

n 

^Si< 361og(l/e)e-/3, for all e e (0,1/4). 

1=1 

Proof. Let d = log(^) and notice that d> 2. It is not hard to see that we can divide [n] into (logn) + l 
different groups {Gt}te{o....,\ogn} based on the sets Qt maintained by the algorithm, as follows. For 
t £ {0, . . . , logn}, set 

^ _ iQt \ Qt+i t < logn 
[Qlogn t = \ogn 

Now, it is not hard to see that, for all t < logn and all i £ Gt, Si < Con[Xi > 2d ■ f3t], since 
Pt < /3. Also for any i G Gt, there must exist some k G (n/2*"'"-'^, n/2*], such that i = TTt+i{k). Then 
by the definition of the algorithm, we know that 



Recall that rj is chosen to satisfy 2 > 1/(1 — 77). Then d ■ 0:^*^2' — ^'^ " Z^*- Lemma 
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gives 



2o! • A > d • a^st >a(l^/2*)^' 



which implies that 

Con[v^ > 2d ■ (3t] < Con[v, > af^^^,^,] 



Using Lemma 21 , we know that 

Con[v^ > < 6ag/2'),(2Vn)'^ < 12d(3t{2' /nf. 

Now, since \Gt\ = n/2*+\ 



J2 Si < l2dpt{2'/nY X n/2*+i = M ■ pt{2'/nf-^ = ^^^{2'^-^)\ 



Thus, 

(logn)-l 



ie[n]\Giog„ t=0 

^6d-^ (2'='-l)l°gn _ 1 



6d ■ 13 n'^-^ - 1 



< 



< 



^d-i 2'^-! - 1 
I2d-P 
2'^ -2 
24d-P 

241og(l/e)e-/3 
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Let i be the unique element in Giogn- Then /3iog„ = X2 . Using Lemma 
X2 ^ , we obtain 
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and the definition of 



2d • /3 > 2d • Aogn > 2d ■ x*-^ > 2(1 - v)d ■ > d ■ af > a^^l = 



A- 



Using the above and Lemma 21 we get 



Si < Con[vi > af,^] < 6e • aj! < I2ed ■ 13. 



Putting everything together, 



<361og(l/e)e-/3. 



4 = 1 



□ 



Using Lemma 26, we obtain 

r"+00 



r+oo " 

/ t ■ f^,,^{xMt)dt <^Si< 361og(l/e)e • f3. 

J2l3logl/e 



It remains to show that 



Pr[max{Xi} > /3/2] > 1 - — 





/2- 



(3) 



We show that, for all Pr 



maxjjXj} > > 1 — 71721 where r/ is the parameter used in Algo- 



rithm 1. This is sufficient to imply (|3|, as 77 < 1/2. Observe that for all i £ [n/2*+i], 



(7rt+i(j)) ^ jTrt+i{i)) 



>/3t, 



where irt+i is the permutation constructed in the t-th round of the algorithm. This implies 
Hence, for ah i G [n/2*+i], Pr[X^^_^^(i) < ^] < 1 - 2Vn. Thus, 



Pr 



max{Xj} > 



I3t 



1+r] 



> Pr 



3i e [n/2*+^], > ^ 

> 1 _ (1 _ 27n)"/2*+' 



+ r/ 



> 1 



=1/2 • 



Eq. ^ now follows. 



F.2 Proof of Theorem 11 (the Reduction from MHR to Balanced Distributions) 

Recall that we represent by {fi}ie[n] the values of the buyer for the items. We will denote their 
distributions by {-Fj}ie[n] throughout this section. 
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F.2.1 Relating OPT to f3 



We demonstrate that the anchoring point /3 of Theorem 12 provides a lower bound to the optimal 
revenue. In particular, we show that the optimal revenue satisfies OPT = This lemma justifies 

the relevance of /3. 



Lemma 27. // f3 is the anchoring point of Theorem 



12. then OPT > ( 1 - 4 



vector would be at 



Proof of Lemma 21: Suppose we priced all items at f . The revenue we would get from such price 



east 



/3 



Pr 



maxjwj} > 



/3' 



-21 a/^ 



where we used Theorem 



12 



Hence, OPT > 1 



For simplicity, we set ci '■= \ ~ j for the next sections, keeping in mind that ci is an absolute 
constant. 



F.2.2 Restricting the Prices 



This section culminates in Lemma 30 (given below), which states that we can constrain our prices 
to the set [e • /3,21og(^) • /3] without hurting the revenue by more than a fraction of where 
C2(e) := 361og(^)e and ci is the constant defined in Section F.2.1 We prove this in two steps. First, 
exploiting our extreme value theorem for MHR distributions (Theorem 12), we show that for a given 
price vector, if we lower the prices that are above 21og(^) • /3 to 21og(-) • /3, the loss in revenue is 
bounded by C2(e) • /3, namely 

Lemma 28. Given any price vector P we define P' as follows: we set p'- = pi, if pi < 21og(^) • ji, 
and p'.i = 21og(^) • (5 otherwise, where e G (0, 1/4). Then the expected revenues IZp and IZpi achieved 
by price vectors P and P' respectively satisfy: TZp' > TZp — C2{e) ■ /3 . 



We complement Lemma 28 with the following lemma, establishing that, for any given price vector P, 
if we increase all prices below e • /3 to e • the loss in revenue is at most e • /?. 

Lemma 29. Let P be any price vector and define P' as follows: set p'^ = pi, if pi > a, and p\ = a 
otherwise. The expected revenues TZp and IZpi from these price vectors satisfy IZpi > TZp — a. 



Combining these lemmas, we obtain Lemma [30} Observe that we can make the loss in revenue 
arbitrarily small be taking e sufficiently small. 

Lemma 30. For all e G (0, 1/4), there exists a price vector P* G [e • /3,21og(^) • /?]"", such that the 



revenue from this price vector satisfies TZp* > 
under any price vector. 



OPT, where OPT is the optimal revenue 



All proofs of this section are in Appendix |F.3.1 



F.2.3 Truncating the Value Distributions 

i.e. that we can constrain the prices to [e • /?, 21og(-) • /?] without hurting the 



30 



Exploiting Lemma 

revenue, we show Theorem |11[ i.e. that we can also constrain the support of the value distributions 



into a balanced range. In particular, we show that we can "truncate" the value distributions to the 
range [| • /3,21og(^) • /3], where for our purposes "truncating" means this: for every distribution Fj, 
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we shift all probability mass from (2 log(^) • (3, +00) to the point 2 log(^) • /3, and all probability mass 
from (— 00, e-/3) to § •/?• Clearly, this modification can be computed in polynomial time if Fi is known 
explicitly; if we have oracle access to Fi, we can produce in polynomial time another oracle whose 
output behaves according to the modified distribution. We show that our modification does not hurt 
the revenue. That is, we establish a polynomial-time reduction from the problem of computing a 
near-optimal price vector when the buyer's value distributions are arbitrary MHR distributions to the 
case where the buyer's value distributions are supported on a balanced interval [umin, c • Umin], where 
c = c(e) = 4^ log(^) is a constant that only depends on the desired approximation e. The proof of 
Theorem [31] is given in Appendix |F.3.2[ 

Theorem 31 (Reduction from MHR to Balanced Distributions). Given e G (0, 1/4) and a collection 
of mutually independent random variables {vi}i that are MHR, let us define a new collection of 
random variables {vi}i via the following coupling: for all i G [n], set Vi = ^ ■ f3 if Vi < e ■ P, set 
Vi = 21og(-) • /3 if Vi > 21og(-) • /?, and set Vi = Vi otherwise, where /3 = P{{vi}i) is the anchoring 



point of Theorem 12 computed from the distributions of the variables {vi]i. Let also OPT be the 
optimal revenue of the seller when the buyer's values are distributed as {^i}iG[n] o.''^^ OPT the optimal 
revenue when the buyer's values are distributed as {i'i}ie[n]- Then given a price vector that achieves 

revenue {1 — 5) ■ OPT when the buyer's values are distributed as {t'ljitEH' ''^^ efficientlly compute 
a price vector with revenue 

Cl 

when the buyer's values are distributed as {fj}jg[n]- 
Theorem [11] follows from Theorem! 



F.3 Proofs Omitted from Section IF. 21 

F.3.1 Restricting the Price Range for MHR Distributions: the Proofs 



Proof of Lemma 28: We first show that, given a price vector, if we make all prices that are above a 
equal to -|-oo, then the loss in revenue can bounded by the sum, overall items whose price was turned 
into -|-oo, of the contribution to this item's expected value by points above a. Formally, 



Lemma 32. Let a > and S{a) = Con[maxiVi > a], where Con[-] is defined as in Definition 20. 
Moreover, for a given price vector P, define P' as follows: set p[ = pi, if Pi < a, and p[ = +co, 
otherwise. Then the expected revenues IZp and TZpi from P and P' respectively satisfy 

TZp' >np- S[a). 

Proof. Let tTj = Pr[ Vj : Vi - pi > vj - pj Avi - pi > 0], vr- = Pr[Vi : Vi - p[ > vj - p'j Avi-p^>0] 
denote respectively the probability that item i is bought under price vectors P and P' . 

We partition the items into two sets, expensive and cheap, based on P: Sexp = {i\Pi>(x} and 
Schp = { i \ Pi<a}- If « e Sexp, p'i = +00, while if i £ Schp, p'i = Pi- 

For any i G Schp, T^i < "^i- Indeed, for any valuation vector (f 1, ^2; ■ ■ ■ , Vn)'- if Vi — pi > vj — pj, for 
all j, and Vi — Pi > 0, it should also be true that Vi — p[ > Vj — p'j, for all j, and Vi — p'^ >0 (since we 
only increased the prices of items different from i). Therefore, the following inequality holds: 

^ P^■7^i< Yl • <■ 

Notice that, for any i G Sexp, whenever the event Ci = : Vi — pi > Vj — pj) A {vi — Pi > 0) 
happens, Vi has to be larger than pi, which itself is larger than a. Hence, whenever the event Ui^Sexp^i 

21 



happens, it must be that maxjjvj} > a. Given that maxj{fj} is the best revenue we could hope to 
extract (pointwise for ah vectors (ui, . . . it follows that 



on 



max{t;j} > a 



S{a). 



Combining this with (1), we obtain 



7^p, >np- S{a). 



□ 



We showed that setting the price of expensive items to +00 is not detrimental to the revenue. We 
show that the same is true of a less aggressive strategy. 

Lemma 33. Let P he a price vector, define Soo = {i : pi = +00}, and let Pmax > ^^^i&[n]\SacPi- 
Let P' he the following price vector: set jl^ — Pmax; 

for all i £ Soo, CLud p'^ = pi otherwise. Then the 
expected revenues TZp and TZpi from P and P' respectively satisfy 

TZpi > TZp. 

Proof. Take any valuation vector {vi,V2, ■ ■ . ,Vn) and suppose that i is the winner under price vector 
P, i.e. Vi — Pi > Vj — pj, for all j, and vi — pi> 0. Under P', there are two possibilities: (1) i is still 
the winner; then the contribution to the revenue is the same under P and P' . (2) i is not the winner; 
in this case, the winner should be among those items whose price was lowered from P to P' . So the 
new winner must some j £ Soo. But notice that p'j = Pmax > Pi- Hence, the contribution of item j 
to revenue under price vector P' is not smaller than the contribution of item i to the revenue under 
price vector P. Hence, TZpi >lZp. □ 



Combining Theorem [12] with Lemmas 32 and 33 it is easy to argue that if we truncate a price 
vector P at value 21og(^) • (5 to obtain a new price vector P' the change in revenue can be bounded 
as follows: 

Tip, >np- C2(e) • 



□ 



Pr[(Vj : Vi 



Pi > Vj - Pj) A {vi - Pi > 0)] and vr- = Pr[(Vj : Vi - p[ > 
= {i '■ Pi < ct}- For all i £ [n] \ Siow, we claim that 



Proof of Lemma 29: Let VTj 
Vj - P'j) ^ i^i - P'i > 0)]) and define Su 
T^'i ^ TTj. Indeed, since we only increased the price of items other than i, for any valuation vector 
V = {vi,V2, . . ■ ,Vn), if Vi — Pi > Vj — Pj, for all j, and Vi — pi > 0, it should also be true that 
Vi — p\ > Vj — p'j, for all j, and Vi — p[ > 0. 
Moreover, it is easy to see that 



Pi • TTj < a. 



Hence, we have the following inequality: 



7^r. 



> ^ Pi- A > Pi-Tri>'Rp 



a. 



i:[n]\Siou 



i:[n]\Siou 



□ 



Proof of Lemma 30: Lemma [28] implies that, if we start from any price vector P, we can modify it into 
another price vector P' that does not use any price above 2 log(^) - /S, and satisfies TZpi >TZp — C2{e)-(3. 
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Then Lemma 



7^P" >TZp> -e- 13. 
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implies that we can change P' into another vector P" G [e • 2 log(-) • /3]", such that 



By Lemma 27, we know that OPT > ci • /3. Hence, if we start with the optimal price vector P 
and apply the above transformations, we will obtain a price vector P* G [e • /?, 2 log(i) • such that 



Tip, > OPT - {e + C2{e)) ■ /3> 1 



e + C2(e] 
ci 



OPT. 



□ 



F.3.2 Bounding the Support of the Distributions: the Proofs 



To establish Theorem 31 we show that we can transform {vi}jg[n] into {^iljgfn] such that, for all i, Vi 
only takes values in [| • 21og(^) • and for any price vector P G [e • /3, 21og(^) • /?]"■, \TZp — Tlp\ < 
C2(e) • (3, where TZp and TZp are respectively the revenues of the seller when the buyer's values are 
distributed as {wi}ie[n] and {'Uj}ig[n]- We first show that one side of our truncation works. 

Lemma 34. Given e G (0,1/4) and a collection of random variables {vi}i that are MHR, let us 
define a new collection of random variables {vi}i via the following coupling: for all i G [n], if Vi < 
21og(i) • (3, set Vi = Vi, otherwise set Vi = 21og(-) • (3, where (3 = (3{{vi}i) is the anchoring point 



of Theorem 12 computed from the distributions of the variables {vi}i. Then, for any price vector 
P € [e • /3, 21og(^) • /5]", \TZp — TZp] < C2(e) • /3, where TZp and IZp are respectively the revenues of 
seller when the buyer's values are distributed as {fi}tG[n] ^''^^ as {i't}«G[n]- 

Proof. For convenience let d = log(^). Recall that {vi}i and {vi}i are defined via a coupling. We 
distinguish two events: (1) v := (f i, ^2; • • ■ > Vn) = (wi, £"2, • • . , Vn) =■ v; (2) Vi / Vi, for some i. 

Under Event (1), the item sold is the same under v and v. Hence, the revenue is the same under 
Event (1). So we only need to worry about Event (2). We show that the total contribution to the 
revenue from this event is very small. Indeed, the probability of this event is Pv\3i,Vi > 2d ■ P], and 
the maximum price is 2d ■ (3. So using our extreme value theorem (Theorem |12|), we can bound the 
contribution to the revenue from this event as follows. 



2d-i3- Pr[3i, Vi > 2d ■ /3] = 2d ■ 13 ■ Pr[maxui > 2d ■ (3] < Con[maxvi >2d- 13]< C2(e) • 13. 

i i 

We obtain |7^p - 1Zp\ < C2(e) • /3. □ 

Next we show that the other side of the truncation works. 

Lemma 35. Given e, f3 > and a collection of random variables {vi}i, let us define a new collection of 
random variables {vi}i via the following coupling: for all i G [n], if Vi > e - f3 , set Vi = Vi, otherwise set 
= I • /3. Then, for any price vector P £ [e • /3, +oo)", TZp = TZp, where TZp and TZp are respectively 
the revenues of the seller when the buyer's values are distributed as {vi}i(^[n] o-nd as {^j}jg[n]- 

Proof. Recall that {vi}i and {vi}i are defined via a coupling. For any value of -u = (wi, £"2, . . . , Vn), we 
distinguish the following cases: (1) v = {vi,V2, • . . , Vn) is exactly the same as v; (2) there exists some 
i such that Vi ^ Vi. 

In the first case, it is clear that the revenue of the seller is the same under values v and v. In the 
second case, we distinguish two further subcases: 

• if there is some item i* sold under values v, we show that i* is also sold under values v. Indeed, 
since i* is the winner under v, we have that, for all i, 



Vi* -pi* > Vi -pi] 
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and 

Vi* - Pi* > 0. 

Given that pi* > e • /?, it must be that Vi* > e • /3. Therefore, Vi* = Vi*. Hence, for any i such 
that Vi = -Oj, Vi* — Pi* > Vi — Pi] and, for any i such that Vi ^ Vi, Vi = ^ ■ j3 < pi. Thus, 
Vi* — Pi* ^ ^ > Vi — Pi. So i* is also the winner under v. 
• if there is no winner under that is Vi — Pi < 0, for all i, then: for any i such that Vi = Vi, 
Vi — Pi < 0, and for any i such that Vi ^ Vi, Vi = ^ ■ (3 < pi, as we argued above. So, Vi — Pi < 0, 
for all i. 

Hence, we have argued the revenues under v and v are the same in the second case too. Therefore, 



7^p = Tip. 



Putting these lemmas together we obtain our reduction. 



□ 



Proof of Theorem 31 : Let P be a near-optimal price vector when the values of the buyer are distributed 
as {^i}jG[n]) one that satisfies 

Up > {I -6) -OPT, 

where TZp denotes the expected revenue of the seller under price vector P when the buyer's values 
are {vi}i^[n]- Given that each Vi lies in [| • /3,21og(^) ■ (3], it follows from Lemma 
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that we can 



(efficiently) transform P into another vector P' G [| • f3, 21og(-) • /3]", such that TZp < TZp/. 

We can then apply the following efficient transformation to P' , to get P": For any i, ii p[ < e ■ 13, 



set p'l = e - /3, and set p'l = p[ otherwise. By Lemma 29, we know that 

TZp" >np> -e-l3. 
Now, since P" is a price vector in [e • (3, 21og(^) • /?]", by Lemmas 

7^p// >np^' -C2(e) -13, 



34 



and 



35 



we get 



where TZp" is the expected revenue of the seller under price vector P" when the values of the buyers 
are {vi}i. 



{V'i}i(^[n]- By Lemma 
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we know that 7^p* > ( 1 - ) OPT. Now Lemmas 







" for values 


34 


and 


35 


give 



TZp* > Up* - C2(e) • /3 > ( 1 - ^-±^) OPT - C2(e) ■ (3 > ( 1 - ilM^l ) qPT, 



ci 



Cl 



where we used that OPT > ci • /3, by Lemma 27 
Since OPT >1Zp*, 

Up, > 7^P > (1 - 6)0PT >{l-6)(l- i±M!) ^ OPT >(l-6- '^^''^^^^ ) OPT. 



Cl 



Cl 



Recall that TZp/t > IZpn — C2(e) • (3 > TZp/ — e ■ (3 — C2(e) • (3. Therefore, 



Up. >(l-5- '^ ) OPT - e • /3 - C2(6) • /3 > (^1 - <5 - + J^^^^^^ ) OPT. 

So given a near-optimal price vector P for we can construct a near-optimal price vector 

P" for {wj}jgr„i in polynomial time. □ 
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G Details of Section [4] 

G.l Proof of Theorem [14] (the Extreme Value Theorem for Regular Distributions) 

We define a explicitly from the distributions {-Fj}i of the variables {Xi}i. We first need a definition. 

Definition 36. A point x is a {ci, C2) -anchoring point of a distribution F, if F{x) G [ci,C2]. 

Now fix two arbitrary constants < ci < C2 < |, and let, for all i, ai be a (ci, C2)-anchoring point 
of the distribution Then define 



n 

a = — • max 

Cl i 



Clearly, a collection ai, . . . ,an of (ci, C2)- anchoring points can be computed efficiently from the Fj's 
(whether we have these distributions explicitly or have oracle access to them.) Hence, an a as above 
can be computed efficiently. We proceed to establish anchoring properties satisfied by a. 



Proposition 37. a > maxjaj^^g, where Qp^ = inf |x|i<i(2;) > 1 — ^^os in Definition 
Proof. Because l/n^<l — C2<1 — F[ai) < 1 — ci, it follows from Lemma 



18 



24 



that 



a,: 



(1 - Fiiai)) > a';^l/n 



Cl 



Hence: a > 



Oi- {l- Fi{ai)) 



> a L This is true for all i, hence the theorem. 



□ 



Proof of Theorem 14- We first show that Pr[Xj > £a] < 2/ {in''), for any £ > 1. By Proposition 37 
and Corollary 25, we have that (£a)Pr[Xi > ia] < J^aFv[Xi > a]. Thus 

.3 



Pr[X, > ia] < 



1 



1 i 



■ Pr[Xi > a] < 2/{£n-'), 



(4) 



which establishes the first anchoring property satisfied by a. 
Moreover, we have that 



1 

Cl 



• max 



Oj • (1 — Fi(ai)) < — max(2; • Pr[max{Xj} > z]), 



which establishes the second anchoring property of a. 

Finally, we demonstrate the homogenization property of a. We want to show that, for any integer 



m < n, thresholds ti, . . . ,tm > t > 



2r?a 



index set S = {ai, . . . ,am} C [n], and e E (0, 1): 



i=i V ^ / 



max{Xa-} > t 



+ 



7e • (2a/e • Pr[maxi{X„J > 2a/e]) 



n 



(s) 

For notational simplicity, we define fi{zi) = Zi ■ Vi[Xa^ > Zi] and fmax{z) = z ■ Pr[maxj{Xa-} > z\. 
Notice that for any ti > t > 2a/e, a double application of Proposition 37 Lemma 24 and Equation Q 
gives 



(n3/e) - 1" 



(nVe) - r V e J 



(5) 
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Thus, 



m ^ m 

LHs<Y^m + j-^j—Y,m) 

i=i \ I ) 



1=1 

m 



< 



1=1 

771 



(nVe) ■ 
2n 



i=l 

fiS) 
I max 



1=1 
m 



t) 
t) 



i=i ^ ^ 



On the other hand, for any t > 2a/e: Pr[Xa^ >t]< Fr[Xa.^ > 2a/e] < e/n^ (using Q). Thus: 

Pr[Xa, >t]> Pr[max{X,J > t] > (l - ejr?) J] Pr[X,, > t], (6) 

% i 

where the last inequahty fohows from the fact that, for ah i, the probabihty that X^. > t, while 
Xa^ < t for ah j e S\ {i} is at least Pr[Xa, > t]{l - e/n^)™"! > Pr[Xa, > t](l - e/n^f. Therefore, 
continuing our upper-bounding from above: 

"1 n / n \ 



i=l 






<it- 


2a/e) 


Pr[max{X„,J > t] 

i 


<it- 


2a/e) 


Pr[max{X„,J > t] 

i 


<{t- 


2a/e) 


Pr[max{X„J > t] 

i 


<{t- 


2a/e) 


Pr[max{X^J > t] 



i=l 
m 



i=l 



2a\ 26 (,) (2_a\ 



6e 



„ J max , 

n \ e 



2a 



where we got the third inequality by invoking (jsj and ([6]), the fourth inequality by invoking ^ with 
t = 2a/e, and the fifth inequality by invoking ([5j) and then ([g]) with t = 2a/ e. This concludes the 
proof of Theorem 14 □ 



G.2 Discussion of Theorem 1141 



In this section we play around with Theorem 14 to gain some intuition about its meaning: 

• Suppose that we set all the tj's equal to t > 2v?al^ . In this case, the homogenization property 
of Theorem 14 essentially states that the union bound is tight for t large enough. Indeed: 



Pr 



max{Xa.} > t 



< [J2Pr[X,^>t] 



.4 = 1 



2a 



< 



< Pr 



Pr 



max{Xa.} > t 



max{Xa.} > t 
7e /2a 



7e /2a ^ 
+ - • — • Pr 

tn \ e 



max{Xa.} > 



2a 



+ 



tn 



•Pr 



max{Xa.} > 



2a 
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This is not surprising, since for all i, the event Xa^ > t only happens with tiny probability, by 
the anchoring property of a. 

Now let's try to set all the ij's to the same value t' > t > 2n^a/e^. The homogenization property 
can be used to obtain that the probability of the event maxj{Xa.} > t' scales linearly in t' . 



Pr 



max{Xa.} > t' 



<J2^T[Xa,>t'] 











< 






1 


< 






t' ' 



2a 



•Pr 



t-Pr 



max{Xa.} > t 

i 

max{Xa.} > t 



7e (2a ^ 

H Pr 

t'n \ e 



max{Xa.} > 



2a 
e 



7e 2a ^ 
+ — • — • Pr 

n \ e 



max{XaJ > 



2a 



This follows easily from Markov's inequality, if the expression in the brackets is within a constant 
factor of E[maxi{Xaj}]. The result is surprising as it is totally possible for that expression to 
be much smaller than E[maxj{Xa.}]. 

In the same spirit as the second point above, the theorem has many interesting implications by 
setting our tj's to different values. We make heavy use of the theorem in the following sections. 



G.3 Proof of Theorem 13 (the Reduction from Regular to po/?/(n)-Balanced Dis- 
tributions) 

G.3.1 Restricting the Prices for the Input Regular Distributions 

Lemma 38. Let V = {fi}ie[n] be a collection of independent regular value distributions, e G (0, 1), 
and c the absolute constant in the statement of Theorem\14\ For any price vector P, we can construct 



a new price vector P G [ea/n , 2n a/e such that IZp > TZp 



(c+lo)£7^opT 



where TZp and TZp are 



respectively the expected revenues under price vectors P and P, and TZqpt is the optimal expected 
revenue for V. 

Proof. First step: We first construct a price vector P' G [0, 2n'^a/e^]" based on P, such that the 
revenue under P' is at most an additive 0{ '^'^°^'^ ) smaller than the revenue under P. 

We define P' as follows. Let S = {i\pi > 2n'^a/e^}. For any i G S* set 
i ^ S set p\ = Pi. Now assume |5| = m. For notational convenience we assume that S = {a 



and set X^. 



Moreover, let t 



2v?a 



^J-II^^ while if 
i G [m]}. 



and ti = pai 



Clearly, the contribution to TZp from items in S is upper bounded by YlT^i^i^^i-^ai > ti]. We 
proceed to analyze the contribution to revenue TZ'p from items in S. Notice that, when maxi^s{vi} = 
maxjjXa.} > t, the largest value-minus-price gap for items in S is at least 2a /e (given our subtle 
choice of prices for items in 5 above). Hence, for the item of S achieving this gap not to be the 
winner, it must be that some item in [n] \ S has a larger value-minus-price gap. For this to happen, 
the value for this item has to be higher than 2a/ e. However, the probability that there exists an 
item in [n] \ S with value greater than 2a /e is smaller than n ■ ejr? = ejr? (by Theorem 14). Thus, 
when maxj{X(j.} > then with probability at least 1 — e/n^, the item in S achieving the largest 
value-minus-price gap is the item bought by the buyer. So when the price vector is P', the revenue 
from the items in S is lower bounded by (t — 2a/e) Pr[maxj{Xa.} > — e/n^) (where we used 
independence and the fact that Pi = t — 2a/ e for all i G S.) 

Clealry, (t — 2a/e) Pr[max{Xa.} > t] < t Pr[maxj{X(j.} > t] < TZqpt- To see this, notice that the 
first inequality is obvious and the second follows from the observation that we could set the prices of 
all items in 5 to t and of all other items to -|-oo to achieve revenue t Pr[maxj{Xa.} > t]. So TZqpt 
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should be larger than this revenue. Similarly, we see that 2a/e • Pr[maxj X^. > 2a/e] < TZqpt- Using 



these observations and Theorem 14 we get 



(t - 2a/ e) Pr[max{X,J > - e/n^) + ' ^^"^"^ 

7e • (2a/e • Pr[maxi{X„.} > 2a/e]) 



>(t - 2a/e) Pr[max{XaJ > t] + 

i 

m 

>Y^tiVT[Xa, >ti]. 



n 



2=1 



The above imply that the contribution to TZpi from the items in S is at most an additive '^^''^opt 
smaller than the contribution to TZp from the items in S. 

We proceed to compare the contributions from the items in [n]\S to TZp and TZp' . We start with 
TZp. The contribution from the items in [n] \ S is no greater than the total revenue when we ignore 
the existence of the items in S (e.g. by setting the prices of these items to +oo), since this only boosts 
the winning probabilities of each item in [n] \ S. 



Under price vector P' , Mi G S, Pr[f j > p^] < ^ (Theorem 14). So with probability at least 1 — 
no item in S has a positive value- minus-price gap and the item that has the largest positive gap among 
the items in [n\ — S" is the item that is bought by the buyer. Hence, by independence the contribution 
to TZ pi from the items in [n] — 5 is at least a 1 — % fraction of the revenue when the items of 5* are 
ignored. 

By the above discussion, the contribution to TZpi from the items in [n] — S is at most an additive 
^^^Z*^ smaller than the contribution to TZp from the items in [n\ — S. 
Putting everything together, we get that TZpi >TZp — iM^£2:_ 

Second step: To truncate the lower prices, we invoke Lemma [29} This implies that we can set 



all the prices below ea/n to ea/n , only hurting our revenue by an additive ea/n < — • maxz{z 



Pr[maxj{Xj} > z]) < ceTZoPT/n (where we used Theorem 14 for the first inequality). 



Hence, we can define P as follows: if p\ < ea/n^, set pi = ea/n^, otherwise set pi = p'-. It follows 

^>^^_(c±ioM 

□ 



from the above that TZp > TZp - (^+io)^^ofr 



Thus, we have reduced the problem of finding a near-optimal price vector in [0, +co]" to the 
problem of finding a near-optimal price vector in the set [ea/n^, 2n^a/e^]". 

G.3.2 Truncating the Support of the Input Regular Distributions 

We show that we can truncate the support of the distributions if the price vectors are restricted. 
Namely 

Lemma 39. Given a collection of independent regular random variables V = {vi}i^[n] '^'^'^ '^'^2/ ^ ^ 
(0, 1), let us define a new collection of random variables V = {■Vj}ie[n] via the following coupling: for 
all i £ [n], set Vi = ^ if vi < set Vi = An'^a/e^, if Vi > 4n^a/e'^, and Vi = Vi otherwise. Also, 



let c be the absolute constant defined in Theorem 14 For any price vector P G [ea/n^, 2n^a/e^]"', 
\TZp{V) — TZp(y)\ < ^'^''^opriV) ^ yjfiQ^g TZp{V) and TZp{V) are respectively the revenues of the seller 
under price vector P when the values of the buyer are V and V. 

Proof. First, let us define another collection of mutually independent random variables V = {i'i}ie[n] 
via the following coupling: for all i G [n] set Vi = An'^a/e^ if Vi > 4n^a/e^, and set Vi = Vi otherwise. 



By Theorem 14, we know that for every i, Pr[t'j > 4n^a/e^] < Hence, the probability of the 



event that there exists an i such that Vi > An'^a/e^ is no greater than n x e" jlriJ = c" /2n^. Thus 

28 



the difference between the contributions of this event to the revenues TZp{V) and TZpiV) is no greater 
than 2n^a/e^ • (e^/2n^) = ^ < ^ • maxz{z ■ Pr[maxj{vi} > z]) < '^^'^opt ^ given that the largest price 
is at most 2n^a/e^. 

Now let us consider the event: Vi < An'^a/e^, for all i. In this case Vi = Vi for all i. So the 
contribution of this event to the revenues TZp{V) and TZp{V) is the same. 
Thus, |7^p(V) - 7^p(V)| < 



Now it follows from Lemma 35 that the seller's revenue under any price vector in [ea/n^, 2n'^a/e^]'^ 



is the same when the buyer's value distributions are V and V. □ 

The above lemma shows that we can reduce the problem of finding a near-optimal price vector 
in [ea/n^,2n?a/e^]'^ for the original value distributions V to the problem of finding a near-optimal 
price vector in the set [ea/n^, 2n'^a/e^]"' for a collection of value distributions V supported on the set 
[^j, 4n^a/e^]. Next, we establish that the latter problem can be reduced to finding any (i.e. not 
necessarily restricted) near-optimal price vector for the distributions V. 

Lemma 40. Given a collection of independent regular random variables V = {vi}i^[n] o^'^ '^'^2/ ^ S 
(0, 1), let us define a new collection of random variables V = {wj}ie[n] "i^io, the following coupling: for 
all i G [n], set Vi = ^ if Vi < set Vi = An'^a/e^ if vi > An^a/e^, and set Vi = Vi otherwise. 



Let also c be the absolute constant defined in Theorem I4 For any price vector P, we can efficiently 



4 0^2^, /,2]n .L„. -r, , ^ T> _ {c+W)e-HopTiV) 



construct a new price vector P € [ea/n , 2n a/e such that TZpiV) > TZp{V) 



n 



The proof is essentially the same as the proof of Lemma [38] and we skip it. Combining Lem- 



mas 38 , 39] and 40 we obtain Theorem 13 The proof is given in the next section. 



G.3.3 Finishing the Reduction 



Proof of Theorem We start with computing a. This can be done efficiently as specified in the 

Now let us define V via the following coupling: for all i G [n], set Vi = ^ 

Vi otherwise. It is not hard to see that the 



statement of Theorem 



14 



if Vi < set Vi = An^/e^ if Vi > 4n^a/e^, and set v- 



distributions of the Vi's can be computed in time polynomial in n, 1/e and the description complexity 
of the distributions of the Uj's, if these are given to us explicitly. If we have oracle access to these 
distributions, we can construct oracles for the distributions of the Vj's that run in time polynomial in 
n, 1/e and the desired oracle accuracy. 

Now let P be a price vector such that 7^p(V) > (1 - e + ^^^^^) ■ 11opt{V). It follows from 
Lemma 40 that we can efficiently construct a price vector P' G [ea/n^, 2n^Q/e^]", such that 



n 



n 



n 



38 



implies that there exists a price vector P G [ea/n , 2n a/e such that TZpiy) > 

we know that 



Lemma 

(1 - • TZoPTiV). By Lemma 



39 



TZoPTiV) > TZpiV) > UpiV) - -TZopHV) >(i- ^^^±M1 ) . 7^oPT(V). 



So TZp'{V) > (1 — e + ^) • 7^opt(V). We can now apply Lemma 39 again, and get 



ce 

^P'(V) > 7^pKV) - -IZoPTiV) > (1 - e) • 7^oPT(V). 
n 



□ 
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H From Continuous to Discrete Distributions: the Details 



We give the details of our discretization results, culminating in the proof of Theorem |15[ Throughout 
this section we assume that the values of the buyer lie in a known finite (but not necessarily 

discrete) interval [umin,Umax], where Umax = f ■ Umin for some r > 1. Our goal is to establish that 
the prices, as well as the values can be restricted to discrete sets. 



H.l Discretization in the Price Domain 

For starters, it is an easy observation that, when the value distributions are supported in the set 
[umin,Umax], it is Sufficient to consider prices that lie in the same range, without any sacrifice in the 
revenue. This is the point of the next lemma. 

Lemma 41 (Price Restriction). Suppose that the value distributions of the items are supported in 
[umin, Umax] CLud let P be any price vector. Suppose we modify the price vector P to a new price vector 
P' as follows: p[ = Umax, if Pi > Umax; p'i = Umin, if p'i < Umin; and p'^ = pi otherwise. The expected 
revenue TZp and IZpi achieved by the price vectors P and P' respectively satisfies TZpi >TZp. 



Proof of Lemma 4I' The proof follows easily by combining Lemmas 42 and 43 below 



Lemma 42. Let P be a price vector and let Sover = {i '■ Pi > Umax}- We define P' to be the following 
price vector: p'^ = Umax, 

for all i £ Sover, o,nd p\ = Pi otherwise. We have TZp' >TZp. 

Proof. Observe that, for any valuation vector (^1,^2, • . . ,Vn) where no item has value larger than or 
equal to its price in P, the revenue is 0. If that's the case, the revenue under P' can be no worse 
than 0. So we only consider valuation vectors where some item i is sold under P. Then it must be 
that Vi — Pi > and, for all j, Vi — Pi > vj — pj. Under price vector P', there are two possibilities: 
(1) i is still the item sold; in this case the revenue is the same under P and P'; and (2) i is not the 
winner anymore; the new winner must be among those items whose price was decreased going from 
P to P', i.e. some item in j G Sover- Observe that p'j = Umax > Pi (since for i to be a winner under 
P, Pi must be no greater than Umax, as otherwise Vi — pi < 0) . Hence, under P' an item j that is at 
least as expensive as i is sold. Given that the above is true point-wise, TZp' > TZp. □ 

Lemma 43. Let P be a price vector and let Sbeiow = {i '■ Pi < Umin}- We define P' to be the following 
price vector: p'- = Umin, for all i G Sbeiow, and p'- = pi otherwise. We have IZpi > TZp. 

Proof. For any valuation vector {vi,V2,. . ■ ,Vn) where no item has value larger than or equal to its 
price in P, the revenue is 0. If that's the case, the revenue under P' can be no worse than 0. So we 
only consider valuation vectors where some item i is sold under P. Suppose first that i ^ Sbeiow Then 
i is still the winner under price vector P' , since we only increased the prices of items different than i 
going from P to P' . If the winner i G Sbeiow, there are two possibilities: (1) i is still the winner; then 
the price paid is higher under P', since = Umin > Pi- (2) i is not the winner anymore; nevertheless, 
there still needs to be a winner since Vi > Umin = p'i, and the price paid is at least Umin > Pi- Given 
that the above hold point-wise, IZpi >TZp. □ 

□ 



Combining Lemma 41 with a price discretization lemma attributed to Nisan ^5j, allows us to 
discretize the set of prices to a set of cardinality 0(^^^), as follows. 

Lemma 44 (Price Discretization). Suppose that the value distributions of the items are supported in 
[umin,Umax]- For any e G (0, 1/2), consider the following finite set of prices: 



Ve = {p 



_ l + e^-e 

P — ' '"mm, « G 
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log 1., {u max 

/U 



For any price vector P G [umin,Umax]^ , we can construct a price vector P' such that p'^ G and 
p[ G [1 — e, 1 + — e] • pi, for all i. The expected revenue achieved by the two price vectors satisfies 
Hp' > (1 - 2e)llp. 

Proof of Lemma \44\ Our proof exploits the following Lemma, due to Nisan. 

Lemma 45 (Nisan). For any e G (0, 1), let P and P' be price vectors that satisfy p[ G [1— e, l+e'^—e]-pi, 
for all i. Then the expected revenue achieved by the two price vectors satisfies TZp' > (1 — 2e)TZp. 



Coming to the proof of Lemma 44 , for every pi define 

2 



, _ 1 + - e 

Pi — I I ■ '^min' 



(1 - e2) 

Observe that 



1 r 2 1 

• Umin G [1 - e ,1\-Pi- 



(1 _ g2^Llogi/(i_,2){Pi/«min)J 

On the other hand, (1 - e2)(l + - e) = 1 - e + - > 1 - e. Thus, p ■ G [1 - e, 1 + - e] • pi, for 



all i. Now Lemma 45 implies that TZp' > (1 — 2e)Rp. □ 



H.2 "Horizontal" Discretizations in the Value Domain 

To enable discretizations in the value domain, we prove an analogue of Nisan's lemma for the value 
distributions. A straightforward modification of Nisan's approach would result in a discrete support 
of the value distributions of size linear in r^logr, where r = Umax/umin- With a more intricate 
argument, we obtain a reduction to a support of size linear in logr. Our discretization result is 
summarized in Lemma [48] and is obtained via an application of Lemma [46) 



Lemma 46. Let and {t'tjieH ^^'^ collections of mutually independent random variables, 

where all Vi's are supported on a common set [umin^Uraax] C M-|-, and let r = Umax/umin- Let also 
6 G ^0) (4|-iog.r-|)i/(2a-i) ) where a G (1/2,1), and suppose that we can couple the two collections of 
random variables so that, for all i € [n], Vi G [1 + 6 — 6'^ , 1 + 6] ■ Vi with probability 1. Finally, let 
TZopT be the optimal expected revenue from any pricing when the buyer's values are {wj}ig[n]- Then, 
for any price vector P G [umin,Umax]^ , such that Tlp{{vi}i) > TZopt/2, it holds that 

np{{vi}i) > (1 - 36'~'')np{{vi}i), 

where TZp{{vi}i) is the expected revenue of a seller using the price vector P when the values of the 
buyer are {filigfni, while TZp{{vi}i) is the revenue under P when the buyer's values are {^'iliefnl- 



Proof of Lemma\4-(^ For notational convenience, throughout this proof we use TZp := 7^p({t'i}i) and 
Up := TZp{{vi}i). 

Consider now the joint distribution of {uj}iG[n] and {vi}i(z[n] satisfying Vi ^ [I + 6 — 6'^ ,1 + 5] ■ Vi, 
for all i, with probability 1. For every point in the support of the joint distribution, we show that the 
revenue of the seller under price vector P is approximately equal in "Scenario A", where the values 
of the buyer are {■Vj}ig[n]> and in "Scenario B", where the values are {?)t}iG[n]- In particular, we argue 
first that the winning prices in the two scenarios are within S ■ Umax from each other with probability 
1. Indeed, for every point in the support of the joint distribution, we distinguish two cases: 

1. The items sold are the same in the two scenarios. In this case, the winning prices are also the 
same. 
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2. The items sold are different in the two scenarios. In this case, we show that the winning prices 
are close. Since Vi is greater than Vi for all i, if there is a winner in Scenario A, there is a winner 
in Scenario B. Let i be the winner in Scenario A, and j be the winner in Scenario B. We have 
the following two inequalities: 

Vi-Pi> Vj - Pj 
Vj - Pj > Vi - Pi 

The two inequalities imply that 

Vj - Vj >Vi- Vi. 

Since Vj S [1 + 5 — 5'^ , I + 5] ■ Vj , it follows that Vj — Vj<6- Vj. Using the same starting condition 
for i, we can show that Vi — Vi > {5 — S'^) ■ Vi. 

Hence, 

6-Vj> {6-6^)-Vi. 

Also we know that 

Pj >Pi+ Vj - Vi. 

Therefore, 

Pj >Pi + Vj - Vi > Pi + {1 - 6) ■ Vi - Vi = Pi - 5 ■ Vi. 

The above establishes that with probability 1 the winning prices in the two scenarios are within 
an additive Sumax from each other. We proceed to convert this additive approximation guarantee into 
a multiplicative approximation guarantee. Observe that whenever pi > 5"'Vi, pi — 5 -Vi > (1 — 6^~"')pi. 
Hence, if we can show that most of the revenue TZp is contributed by value-price pairs {vi,pi) satisfying 
Pi > 5°'Vi, we can convert our additive approximation to a (1 — 5^~°') multiplicative approximation. 
Indeed, we argue next that when a price vector P satisfies TZp > IZopt/'^, the contribution to the 
revenue from the event 

S = {the sold item k satisfies pk < S"'Vk} 

is small. More precisely. 

Proposition 47. If TZp > TZopt/'^, then the contribution to TZp from the event S is no greater than 
IS^-^TZp. 

Proof. The proof is by contradiction. For all i G [[logr]], define the event 

Si = {{the sold item k has price pk < S"'Vk) A {pk € [2^~^Umin, 2*Umm))}- 

Note that Si and Sj are disjoint for all i j. Let Up = [logr] and note that S = u"^j^5i. Assuming 
that the contribution to TZp from the event S is larger than 26^~"'TZp, there must exist some i such 
that the contribution to TZp from Si is at least 26^~"'TZp/np > 6^^"'TZoPT/n-p- For this i, let us modify 
the price vector P to P' in the following fashion: 

/ J ~l~00 Pfc ^ [2' Uminj^ UfYiin) 

Pk 1 2'-^^M^i„ otherwise 

We claim that for all outcomes (fi,f2,. . . ,fn) £ Si, there always exists an item sold under P' . 
Indeed, let k be the winner under P. Then pk < 5'^Vk. By the definition of p^, we know that 

2*~^i; ■ 

/ "■mm ^ I s:a ^ 
Pk = - Pkl^ < 

^To be more accurate, replace the set 2'iimi„) by [2'~^?imi„, for the definition of the event Snp. 
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Thus, an item has to be sold. Moreover, the sold item has price ^ — g"*"™ ; as all the other prices are 
set to +00. Hence, we can lower bound TZpi as follows 

-n ^ T. roi 2*"^Mmm ^ Contribution of Si to Up 5^^°-TioPT 

Tcpi > Fr lb,- • > > . 

P - ^ Sa - 26" - 2np5« 

Given that 5 < (4^)^^^^""^^ the above implies IZpi > 2TZopt, which is impossible, i.e. we get a 
contradiction. This concludes the proof of the proposition. □ 

Given the proposition, at least (1 — 2(5^~") fraction of TZp is contributed by value-price pairs {vi,pi) 
satisfying Pi > 6"'Vi. Recalling our earlier discussion, this implies that T^P > {l—26^~"'){l—6^~"')Tlp > 
(1 - 35i-")7lp. □ 



Lemma 46 allows us to discretize the support of the value distributions into a discrete set of 



bounded cardinality without harming the revenue, as specified by the following lemma. 

Lemma 48 (Horizontal Discretization of Values). Let be a collection of mutually independent 

random variables supported on a set [umin, Umax] C M+, and let r = . For any 5 G ( 0, 



(4riogr])4/3 

there exists another collection of mutually independent random variables {vi}i,^[n], which are supported 
on a discrete set of cardinality O ^^^^^ md satisfy the following properties. 

1. The optimal revenue when the buyer's values are {vi}ii^[n] is at least a (1 — 2>5^^^) -fraction 
of the optimal revenue when the values are {fi}jg[n]- I-^- T^opt > (1 — 'i^^^^)T^OPT, where 
T^OPT = maxp 7^p({uj}i) and IZopt = niaxp T^pdwiji). 

2. Moreover, for any constant p G (0,1/2) and any price vector P such that TZp{{vi}i) > (1 — 
p)Ti-OPT, we can construct in time polynomial in the description of P, 1/6 andlogUmin another 
price vector P such that TZp{{vi}i) > (1 — 76^^^ — p)TZoPT- 

Moreover, assuming that the set [umin,Umax] is specified in the inputl^ we can compute the support 
of the distributions of the variables {vi}i in time polynomial in logUmin, log Umax and 1/6. We can 
also compute the distributions of the variables in time polynomial in the size of the input and 

1/e, if we have the distributions of the variables {vi}i^[n] explicitly. If we have oracle access to the 
distributions of the variables we can construct an oracle for the distributions of the variables 

{vi}i^[n], running in time polynomial in logUmin, logiimax; ^/S, the input to the oracle and the desired 
precision. 



set 



Proof of Lemma J^8: We begin with the description of the random variables {wj}jg[n]- We will use 
{Fi\i<=\n\ and {Fi\i<^\n\ to denote respectively the cumulative distribution functions of the variables 
{^i}ie[n] and Taking = ^j^^_^2 , our new variables {?)i}ie[ra] '^i^^ supported on the 

la, = (1 + 5)(1 + O'Urmn J G {o, . . . , [logl+g 1 1 . 

Moreover, for all i, the probability mass that Fi assigns to every point in its support is: 

Fiiaj) = Fi{aj/{1 + 6 - 6^)) - Fi{a,/{1 + 6)). 

Now, for all i, we couple Vi with Vi as follows: If G [oj / {l + 6),aj / {1 + 6 — 6"^)), we set Vi = aj. Given 
our definition of the i^j's, this defines a valid coupling of the collections V = {vi}i and V = 

''The requirement that the set [umin,Umax] is specified as part of the input is only relevant if we have oracle access 
to the distributions of the Vi's, as if we have them explicitly we can easily find [umin,Umax]. 
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Moreover, by definition, our coupling satisfies t)j G [1 + 6 — 5'^ ,1 + 6] ■ Vi, for all i, with probability 1, 
and all the -Oj's are supported on [(1 + 5)umin, (1 + S)umax]- 



We are now ready to establish the first part of the lemma. Using Lemma 46 and the property of 
our coupling it follows immediately that 

7^p(V) > (1 - 35^/^)7^p(V), 



for any price vector P G [umin,Umax]^ s.t. Tlp{V) > \1Zopt- Lemma 41 implies that the optimal 
revenue for V is achieved by some price vector in [umin,Umax]^- Hence, we get from the above that 

noPT > (1 - 3^l/8)7^oPT. 

We proceed to show the second part of the lemma. We do this by defining another collection 
of random variables V = {vi}i^[n]- These are defined implicitly via the following coupling between 
{vijiein] and {vi}i(z\n]- for all i, we set 



Vi 



as follows: 



It follows that the Vj's are supported on [umin/ {I + 6 — 5"^), Umax/ (1 + (5 — (5^ 
Moreover, for any price vector P, let us construct another price vector P 

Under our coupling between {^i}ig[ri] and {vi}i^[n], it is not hard to see that if we use price vector 
P when the buyer's values are {'Oi}ig[n] and price vector P when the buyer's values are {vi}i(z[n], then 
the index of the item that the buyer buys is the same in the two cases, with probability 1. Hence: 



This follows from the fact that both P and {^i}i6[n] are the same linear transformations of P and 
{vi}i(^[n] respectively. 

Composing the coupling between Vi and Vi and the coupling between Vi with Vi, we obtain a 
coupling between Vi and Vi. We show that this coupling satisfies Vi G [1 + 6 — 6'^ ,1 + 6] ■ Vi, with 
probability 1. Since (1 + 6- 6^)vi <Vi<{l + 5)vi, 

Vi/{1 + 5)< Vi/{1 + 5-5^)il + d) = v,< Vi/{1 + 5- 5^). 

G [1 + 5 - (5^ 1 + <5] • iJi. 



Given that Vi G [1 + 5 — 5 ,1 + 5] ■ Vi with probability 1, an application of Lemma 46 implies that, 
for any price vector P G [umin/ (1 + 5 - 5'^), Umax/ (l + S - 5^)]" satisfying TZp{V) > ^T^opriV): 



np{V) > (1 - 3(5^/S)7^p(V). 



(9) 



Now let P be a price vector satisfying TZp{V) > (1 — p)T^oPT- Lemma 41 implies that WLOG 
we can assume that P G [(1 + S)umini (1 + S)umax]^ (as if the given price vector is not in this set, 
we can efficiently convert it into one that is in this set without losing any revenue.) Then the vector 
P obtained from P via Eq. ([T]) is in [umm/(l + <5 — S'^),Umax/{^ + S — 5^)]", and clearly satisfies 
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T^p(y) > (1 — p)T^optO^), as P and V are the same linear transformations of P and V respectively. 
Hence, Equations ^ and ^ give 



^p(V) > ((1 - 35i/«)/(l + + 6- d^)jnp{V) 

> (l-3(5^/S)(l-25)7lp(V) 

> (1 - A6^/^)np{V) 

> (l-45i/8)(l-p)^oPT 

> (1 - 4(5^/^)(l - p)(l - 3(5^/^)7^oPT (using the first part of the theorem) 

> (1 - 751/8 - p)noPT- 



□ 



H.3 Proof of Theorem 



Proof of Theorem 15: Lemma 48 implies that we can reduce the problem Price(V, e) to the problem 
Price(V, (e/8)8), where V = {vi}i is a collection of mutually independent random variables supported 
on a common discrete set 5 = {s^^) . . ,s('=i)} C [(1 + {e/8f)umin, (1 + (e/Sf) 

Umax] of cardinality 



ki = 0(-^^). Now, Lemmas 41 and 44 imply that we can reduce the problem Price(V, (e/8) ) to the 
problem ResrtictedPrice(V, V, 0.5(e/8)8), where P is a discrete set of prices of cardinality 0{^^f). 
We omit the analysis of the running time of the reduction, as it is straightforward. □ 

H.4 "Vertical" Discretizations in the Value Domain 



Lemma 48 allows us to reduce our problem into one where the support of the value distributions is 



a bounded set of cardinality 0(^^^), where r = Umax/umin without worsening the revenue by more 
than a fraction of 6^^^. The following lemma allows us to also round the probabilities assigned by 
the value distributions to every point in their support. But, this result is weaker as we require the 
probabilities to scale polynomially in both the number of distributions n and r. Still, the lemma is 
handy in the design of our algorithms as it helps keep our calculations in finite precision. 

Lemma 49 (Vertical Discretization of Values). Let {?^^}ie[n] ■^st of mutually independent random 
variables supported on a discrete set S = jsi, . . . , Sm} and let \umin: Umax] C M and r = . It is 
assumed that S as well as Umin, Umax o.'^s. specified as part of the input. Then we can construct another 
collection of mutually independent random variables {v'l}i^[n]j which are supported on the same set S 
and are such that the probability that v'l equals any value in S is an integer multiple of ^^^^ for all i. 
Moreover, for any price vector P G \umin)Umax 

]"■, if IZ'p is the seller's expected revenue under price 
vector P when the buyer's values are {f,'}j6[n]; md TZp is the revenue under price vector P when the 
buyer's values are {v'^}ie[n]! then 

I II Am 

\i^P ~ I^P] ^ 9 ■ Umin- 

rn'^ 

Moreover, the distributions of the variables v'l are constructed explicitly, and the running time of 
the construction is polynomial in n, m, logr, the description complexity of S, and the description 
complexity of the distributions of the v'^ 's, if these are given explicitly in the input. 



Proof of Lemma 49' If we know the distributions explicitly, then, for all i, we construct the distribution 



of v'l based on the distribution of v'^ as follows. Let tt'^. = Pr[u^ = sj], and 

S' = vr' - , • I vr' • (rn)^J . 
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Denoting Pr[u-' = sj] by tt'^. (for convenience), we set tt"^. := ir'g^. — 6'g., for j > 2, and vr"^ := 

It is not hard to see now that we can couple the two sets of random variables in the following way. 
For every i: if v'^ = si, we let v'-' = si; if v'^ = sj for j > 2, we let v'^ = Sj with probability 1 — 

and let = si with probability Notice that this coupling satisfies: 



j=2 j=2 ^ ^ 

Now taking a union bound over all i, the probability that the vector v' = {vi,V2, ■ ■ ■ , v'^) is different 
from v" = {v'l, ^2 , . . . , v") is at most ■ In other words, with probability at least 1 — -j^^-, v' = v" . 
Clearly, for all draws from the distribution such that v' = v" , the revenues are the same. When 
v' 7^ v" , the difference between the revenues is at most Umax, since P G [umin,Umax]"'- And this only 
happens with probability at most / "^.^ . Therefore, the difference between the expected revenues under 

yrn) 

the two distributions should be no greater than • Umax < ^ • Umin, i-e. {TZp — TZpl < ■ Umm- 
Clearly, we can compute the distributions of the f^"s in time polynomial in n, m, logr and the 
description complexity of the distributions of the variables f^'s, if these distributions are given to us 
explicitly. If we have oracle access to the distributions of the v'^s we can query our oracle with high 
enough precision, say l/(rn)^, to obtain a function gi : S ^ [0, 1] that satisfies X]xe5 ~ 1 ± ^^^p . 
Using Qi as a proxy for the distribution of v[ we can follow the algorithm outlined above to define the 
distribution of v'l . It is not hard to argue that the total variation distance between v[ and v'l can be 
bounded by ^^^3 . Hence, we can couple v[ and v'l so that 

or// //i ^ 4m 



{rnY 

and proceed as above. □ 



I The Algorithm for Discrete Distributions 

I.l The DP Step: Add an Item and Prune the Table 

In Section [rj we described what our intended meaning of the Boolean function (7(i,Pr). Here we 
discuss how to compute g via Dynamic Programming. Our dynamic program works bottom-up (i.e. 
from smaller to larger z's), filling in 5's table so that the following recursive conditions are met. 

• If i > 1, we set g{i, Pr) = 1 iff there is a price p'^^^ and a distribution Pr so that the following hold: 

1. g{i-l,VT) = 1. 

2. Suppose that under some pricing the (winning-value, winning-price) distribution for the prefix 
1 ... i — 1 of the items is Pr , and we assign price p^-'^ to the i-th item. We can compute the 
resulting (winning- value, winning-price) distribution {Prij,i2}iie[/ci], i2&[k2\ prefix 
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from just Prj^ the distribution Fj. Indeed: 



+ 



-/ 



Pr 

. l)0l)-p{J2) 

\ <t,{n)_p(i2) / 



Pr [vi = t;(*i)] • 1 



p(i)=p(»2) • 



ij, and / = f Eiie[fci], j2e[fc2] "^^i-'O / (^)' we require that the first / probabihties in 



We require that Pr is a rounded version of Pr , where ah the probabihties are integer multiples 

, - — -// 
of The rounding should be of the following canonical form. Setting Si-^^^i^ = Prj^ ~ 
-^// 

{Prj^ j2}iie[fci],[j2]eA:2 ™ some fixed lexicographic order are rounded up to the closest multiple of 
and the rest are rounded down to the closest multiple of 

(If Conditions 1 and 2 are met, we also keep a pointer from cell g(i,Pr) to cell — l,Pr ) of the 
dynamic program recording the price p^-'^) on that pointer. We use these pointers later to recover 
price vectors consistent with a certain distribution.) 

• To fill in the first slice of the table corresponding to i = 1, we use the same recursive definition 
given above, imagining that there is a slice i = 0, whose cells are all except for those corresponding 
to the distributions Pr that satisfy: Prj^^jj — 0) ^i; ^2, except for the lexicographically smallest 

{i*i,i*2) G argmin(fc^^fc2) u^^i) -p^'^a)^ where Pri*,i* = 1. 

While decribed the function g recursively above, we compute it iteratively from i = 1 through n. 
1.2 Lower Bounding the Revenue 

In this section, we justify the correctness of the algorithm presented in Section [7j providing a proof 



of Lemma 16 Intuitively, if we did not perform any rounding of distributions, our algorithm would 
have been exact, outputting the optimal price vector in {p^^\ . . . jp^'^^^}" . We show next that the 
rounding is fine enough that it does not become detrimental to our revenue. To show this, we use the 
probabilistic concepts of total variation distance and coupling of random variables. Recall that the 
total variation distance between two distributions P and O over a finite set A is defined as follows 



J j;|P(a)-Q(a)|. 



Similarly, if X and Y are two random variables ranging over a finite set, their total variation distance, 
denoted \ \X — Y\\tv is defined as the total variation distance between their distributions. 

Proceeding to the correctness of our algorithm, let P = {pi,P2, ■ ■ ■ ,Pn) G {p^^\ ■ ■ ■ jp^'^^^}^ be an 
arbitrary price vector. We can use this price vector to select n cells of our dynamic programming 
table, picking one cell per layer. The cells are those that the algorithm would have traversed if it 
made the decision of assigning price pi to item i. Let us call the resulting cells celli, celh, ■ ■ ■ , celln- 

For all i, we intend to compare the distributions /Pr-*''. \ and < Pr- ^ 

I *i'*2i*ie[fci], i2G[fc2] 1 

which are respectively the (winning- value, winning- price) distribution: 

• arising when the prefix 1 ... i of items with distributions {i^j}j=i,...,j is priced according to price 

vector (pi, . . . ,pj); 
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1*2 



'jie[fci], j2e[fc2] 



• stored in celli by the algorithm. 

The following lemma shows that these distributions have small total variation distance. 

Lemma 50. Let and be two collections of {kik2)- dimensional random unit vectors 

defined in terms of {Pr»} and <|pr'"''j> as follows: for all i G [n], ii G [ki], i2 € [^2], and 

£ = {ii — 1) ■ k2 + 12 ^ [kik2], we set Pr[Xj = ei] = P^i^^i^ and Pr[Xj = ei] = P^l^^i^, where ei is the 
unit vector along dimension I. 
Then, for all i, 

I \Xi — Xj| ^ nk\k2lrrC' . 

Proof. We prove this by induction. For the base case (i = 1), observe that ||Xi — XiH^^y < kik2/m^, 

because Pr is just a rounding of Pr^^^ into probabilities that are multiples of whereby the 
probability of every point in the support is not modified by more than an additive —3- . 
For the inductive step, it suffices to argue that for all i S [n — 1] , 

ll-^j+i — -^j+i||rv — ll-^j — -^illTV < kik2/m^. 

To show this, we are going to consider two auxiliary random variables l^+i and Zj+i: 

• 1^+1 is a (A;iA;2) -dimensional random unit vector defined as follows: for € [k2\ such that 

= Pi+i, if ^ = (ii-l)-fc2+i2, then Pr[yi+i = e<.] = otherwise Pr[yi+i = e^^ 

• Zj+i is a (/ciA;2)-dimensional random unit vector defined as follows 

Pr[Zi+i = e(i^_i).fc2+j2] = ^ Pr[Xi + Fj+i = ei^i^^iyk^j^i^ + e^^j^.iyk^j^j^], (10) 

h J2 
s.t. v'-h)-pU2) 

where for the purposes of the above definition Xi and l^+i are taken to be independent. |^ 
We claim that 

\\Xi+i - Zi+i\\Tv < ||(X, + y,+i) - (X, + K,+i)||ry. (11) 
Indeed, we can define Xj+i as follows: for all ii, i2, 

Pr[Xi+i = e(j^_i).fc2+j2] = ^ Pr[Xi + yi+i = e(jj_i).fc2+i2 +eyj_i).fc2+j2], (12) 

jl ,32 
s.t. i,(Jl)-p02) 
<l,(si)_p(»2) 



where for the purposes of the above definition Xi and l^+i are taken to be independent. Now (10), 
(12) and the definition of the total variation distance imply (11). 

To finish our proof, suppose further that we couple Xi and Xi optimally. By the optimal coupling 
theorem our joint distribution satisfies Pr[Xj 7^ Xj] = \ \Xi — Xi\\Tv- Defining l^+i in the same space 
as Xi and Xi so that Yi+i is independent from both Xi and Xi, the coupling lemma implies: 

\\{Xi + Yi+i) - (X + Y^+i)\\tv < Pr[(X + Yi+i) / (Xi + Yi+i)] 

< FT[Xi / X,] 

= \ \Xi — XiWxv (since Xi and Xi are optimally coupled) 
On the other hand, it is easy to see that ||^j+i — Xj+illry ^ kik2/m^. Indeed: 



In other words, Fi+i is the random unit vector analog of Fi+i. 
^ Zi+\ is the random unit vector analog of the (winning-value,winning-price) distribution for the prefix 1 . . . i + 1, if 
item i + 1 is assigned price Pi+\ and the (winning- value,winning-price) distribution for the prefix 1 . . .i is Pr^. 
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• is the random unit vector analog of the (winning-value, winning-price) distribution for the 
prefix 1 . . . z + 1 of items, if item i + 1 is assigned price Pi+i and the (winning- value, winning-price) 
distribution for the prefix 1 ... z is Prj ; 

• Xj+i is the random unit vector analog of the same (winning-value, winning-price) distribution 
as above, except after rounding that distribution according to the rounding rule used in our 
dynamic program; 

• the rounding changes every probability in the support by at most an additive 
Combining the above and using the triangle inequality, we obtain 

ll-'^j+i — -^i+iIItv < ll^j+1 — -^i+illry + ll-'^i+i - ^i+illry < kik2/m'^ + \ \Xi — Xi\\Tv ■ 

□ 



Proof of Lemma 16: Let P* be the optimal price vector for the instance of the pricing problem obtained 



after the application of Lemma 49 , and let cell* be the cell at layer n of the DP corresponding to the 



price vector P* . Lemma 50 implies that 

|Pr,H -Frgj<nfcifc2V, 
where Pr^"^ is the true (winning- value, winning-price) distribution corresponding to price vector P* 

(n) 

and Pr is the distribution stored in cell cell* . Clearly, the expected revenues IZp* and TZceii* of 
these two distributions are related, as follows 

|7^P. -7^ee^p| < 2^ \Fvll^^-PTi^^iJ-p^^> <-^-max{V} <—^-mm{V}. 

jie[fci], j2e[fc2] 

Now let cell' be the cell at layer n of the dynamic programming table that has the highest revenue, 
and let P' be the price vector reconstructed from cell' by tracing back-pointers. Using the same 
notation as above, call TZceW the revenue from the distribution stored at cell' and TZp' the revenue 
from price vector P' . Then we have the following: 

T^ceW > T^ceii*; (by the optimality of cell') (13) 

kik2 



\TZpi — TZceii'l < — 5- • mm{V}. (using Lemma 50, as we did above) (14) 



Putting all the above together, we obtain that 

7^p/ > TZp. - • mm{V}. (15) 

Hence, the price vector P' output by the dynamic program achieves revenue TZp' that is close to 
the optimal in the instance of the pricing problem obtained after the first step of our algorithm, i.e. 



the instance obtained after invoking the reduction of Lemma 49 , We still have to relate the revenue 
that P' achieves in the original instance (i.e. before the reduction of Lemma 49) to the optimal 
revenue OPT of that instance. For this, let us define the quantities: 

• TZ{P*): the revenue achieved by price vector P* in the original instance; 

• 1Z{P'): the revenue achieved by price vector P' in the original instance. 
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Using Lemma 49 we can easily see that 



7^(P*) > OPT 



8m 



min{7^}. 



Moreover, 



n{P') > IZp, 



Am 



minj^}; and 



Up* > n{p*) 



Am 



minlP}. 



Combining these with (15), we get 



„ 2k\ko Am 
niP') > Tip* V • min{P} ^ • miniP} 

> TZiP ) 7^ ■ mmjP} ^ • mm{P} 

m^ rw^ 



> OPT 



2kik2 + 16mr 



m^ 



min{7^}. 



(16) 
(17) 
(18) 



□ 



1.3 Analyzing the Running Time 



Proof of Lemma 11: Recall that both the support S = {v 
.(1) 



(1) „(2) 



,(fcl) 



,P 



} of the value distributions 



} of prices are explicitly part of the input to our algorithm. 



and the set V := | 

Given this, the reduction of Lemma 49 (used as the first step of our algorithm) takes time poly- 
nomial in the size of the input. After this reduction is carried out, the value distributions {Fi}i that 
are provided as input to the dynamic program are known explicitly and the probabilities they assign 
to every point in S are integer multiples of where m = nr, r = maxp^-'^/p^*). 

We proceed to bound the run-time of the Dynamic Program. First, it is easy to see that its table 
has at most n x (m^ + l)*'i'^2 QgHs, since there are n possible choices for i and m^ + 1 possible values 
for each Prj^^jj. Our DP computation proceeds iteratively from layer i = 1 to layer i = n of the table. 
For every cell of layer i, there are at most k2 different prices we can assign to the next item z + 1, and 
for every such price we need to compute a distribution and round that distribution. Hence, the total 
work we need to do per cell of layer i is polynomial in the input size, since our computation involves 
probabilities that are integer multiples of ^ . Indeed the probability distributions maintained by our 
dynamic program use probabilities that are integer multiples of and recall the distributions Fi 
also use probabilities in multiples of Hence, the total time we need to spend to fill up the whole 
table is rnP'^^^^'^\ In the last phase of the algorithm, we exhaustively search for the cell of layer n 
with the highest expected revenue. This costs time polynomial in the size of the input and rrP'^^'^^^\ 
since there are mP^^^^'^^ cells at layer n, and the expected revenue computation for each cell can be 
done in time polynomial in the input size. Once we find the cell with the highest expected revenue, 
we can obtain a corresponding price vector in time 0{n), since we just need to follow back-pointers 
from layer n to layer 1. 

Overall, the running time of the algorithm is polynomial in the size of the input and rnP'^^'^^'^\ □ 



J Summary of Algorithmic Results 

In this section, we prove our main algorithmic results (Theorems [T| [2| [sj |4| and [s]) . We only mildly 
try to optimize the constants in our running times. We should be able to improve them with a more 
careful analysis. We start with the proof of Theorem [Sj which we repeat here. 
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Theorem 51 (Finite Support). Let {i^i}igr„] he a collection of distributions that are supported on 



a common set [umin,Umax] C M+ which is specified as part of the input, and let r := Umax/umin- 
Then, for any constant e > 0, there is an algorithm that runs in time polynomial in the size of the 
input and n^°^ '^^'^ and computes a price vector P such that 

Up > (1 -e)OPr, 

where TZp is the expected revenue under price vector P when the buyer's values for the items are 
independent draws from the distributions {Fi}i and OPT is the optimal revenue. 



Proof of Theorem 
expected revenue (T 



51: First set e = min |e, (4|-|og^^-|)i/6 1- Clearly, it suffices to find a price vector with 
- e)OPT. Now, let us invoke the reduction of Theorem |15[ reducing this task to 
solving the discrete restricted-price problem RESTRiCTEDPRiCE({Fi}j, 7^, 0.5(e/8)^), where the value 
distributions {Fi}i are supported on a discrete set S = {s*-^^, . . . , s^'^i)} of cardinality ki = 0(logr/e^^) 
and the prices are also restricted to a discrete set V = {p'-^^ . . . ,p'^^^^} of cardinality k2 = 0(log r/e^^). 
It is important to note that 5 C [(1 + {e/8)^)umin, (1 + {(^ / 8)^)umax] and minj{p(*)} < minjjs^*)} (this 



can be checked by a careful study of the proof of Theorem 15). Hence, if OPT is the optimal revenue 
of the discrete instance resulting from the reduction, we have OPT > miuj It is our goal to 

achieve revenue at least (1 — 0.5(e/8)^)OPT in this instance. 

To do that, we invoke the algorithm of Section[7]or a modified version of it, depending on the value 
of n . In particular, if n > for a large enough constant c, we use the algorithm as is, obtaining a 
price vector with revenue at least 

°- - - ") ■ >- °- - - ")) 

> OPT {1 - 0{e^)) , (20) 

log^ r 

as we wanted. The running time of the algorithm in this case is polynomial in the input and (nr) ^ 

that is polynomial in the input and n~e^ (assuming n > 2). If n < we modify the input, 
introducing dummy items so that the total number of items for sale is n' = The dummy items 
have distributions that place a point mass of 1 at value min{s^*^}/2 (so we extend the set of values 
{s^^\ . . . , s^^^^} by one point s^") := min{s(*'*}/2), and hence they do not contribute anything to the 
revenue, since the minimum price satisfies minlp*^*-*} > min{s^*'*}/2 (this is easy to check from the 
proof of Lemma 44 ) Invoking the algorithm of Section [T] on the augmented instance, we obtain a 



price vector with revenue 

> OPT {1 - 0{e^)) , (22) 
as we wanted. The running time of the algorithm in this case is polynomial in the input and (n'r) , 

log^ rlog(l/j) 

that is polynomial in the input and n (assuming n > 2). Being a bit more careful in the 

application of our discretization lemmas we can obtain a running time of n^°s^^A'. □ 

Theorems [T] and [2] now follow immediately from Theorem |3] (Theorem 51 in this section) and our 
structural theorems for Monotone-Hazard-Rate and Regular distributions (Theorems 11 and 13 of 



Sections [3] and [4] respectively). Here are their statements with some constant optimizations. 

^°The requirement that the set [uminjUmax] is specified as part of the input is only relevant if we have oracle access 
to the distributions of the Vi's, as if we have them explicitly we can easily find [umin,Umax]- 
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Corollary 52 (PTAS for MHR Distributions). Suppose we are given a collection of distributions 
{Fi}i^[n] that are MHR. Then, for any constant e > 0, there exists an algorithm that runs in time 
o( ^ ) 

n and computes a price vector P such that 

np>{i- e)noPT, 

where TZp is the expected revenue under price vector P when the buyer's values for the items are 
independent draws from the distributions {-Fjj andlZoPT is the revenue achieved by the optimal price 
vector. 

Corollary 53 (Quasi-PTAS for Regular Distributions). Suppose we are given a collection of distri- 
butions {i^j}ie[n] that are regular. Then, for any constant e > 0, there exists an algorithm that runs 

in time n ^ ^ ' and computes a price vector P such that 

np>{i- e)noPT, 

where TZp is the expected revenue under price vector P when the buyer's values for the items are 
independent draws from the distributions {F}i andlZoPT is the revenue achieved by the optimal price 
vector. 

We proceed to give the proof of Theorem |4] 

Proof of TheoremlA Denote by V = {"Uijigin] the buyer's values for the items, and let OPT be the 
optimal revenue. Also, let e' = e/3. 



By Lemma 41, we only need to consider price vectors in [0,1]". On the other hand, Lemma 29 
tells us that if we restrict the prices to be higher than e', we lose at most an additive e' in revenue. 
So there exists a price vector P £ [e', 1]", such that TZpiV) > OPT — e' 

Now, we define a new collection of random variables V = {^'i}ie[n] via the following coupling: for 
all z G [n], set ■Uj = |- if tij < e', and Vi = vi otherwise. According to Lemma 35, for any price vector 
Pin [e',lf,TZp{V)=np{V). 
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and 



29 



we 



If we are given a price vector P, such that TZpiy) > (1 — e')TZoPT{y)- By Lemma 
can efficiently convert P to P' G [e', l]*", such that 7^p/(V) > 7^p(V) - e'. 
Combining the inequalities above, we have 

^opt(V) > 7^p(V) = 7^p(V) > OPT - e', 

and 

7^p,(V) = 7^P'(V) > TZpiV) - e' > (l - e')7^oPT(V) - e. 

Thus, 

np>{V) > (1 - e)OPT - 2e > OPT - 3e'. 
So the algorithm is as follows. We first construct V = {^i}igfnl! which can be done in time 



polynomial in n and 1/e. Next, we run the PTAS of Theorem 51 on [|-, 1], which runs in time n^^^'^ 



to get P. Finally, we convert P to P' in time polynomial in n and 1/e. □ 
We conclude this section with the proof of Theorem [8| 

Proof of Theorem We sketch the argument for MHR distributions. The argument for regular 
distributions is similar. Suppose we have an a-approximation to the optimal revenue for an instance 
of the pricing problem in which the values are independent and come from MHR distributions. That 
is, suppose we have /?' such that a/3' > OPT. We argue that f3' can play essentially the same role 
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as /3 in Theorem 11, More precisely, we show that we can reduce the problem of finding a near- 
optimal price vector for the original instance to finding a near-optimal price vector for a collection of 
distributions that are supported on a common interval centered around /?'. Hence, we can leverage 
our knowledge of^' to proceed with our algorithm, obviating the need to compute /3. 



From Lemma 



27 



we know that OPT > (1 



Hence, > /3. Now set c 



2a 



Theorem 12 it follows that for any e G (0, 1/4) 



Con 



maxjvi} > 2 log 



<36elog( ^ ) (c/3'). 



1- 



From 



Given this property and that /3' < OPT, we can reproduce the proof of Theorem 11 to argue that the 
resulting distributions can be truncated to the interval [e/3', 2clog(^)/3'], without losing more than a 
0(€)-fraction of the optimal revenue. □ 



K A Single Price Suffices for I.I.D. MHR Distributions 

We provide a faster algorithm for the case where the buyer's values are independent and identically 
distributed according to some MHR distribution. The main technical idea that goes into the algorithm 
is this: if the number of items is a sufficiently large function of e, then using a single price suffices to 
get an e fraction of the optimal revenue. We proceed to the details of our algorithm. To simplify our 
notation, let us assume that all the Uj's are independent copies of the random variable v, and denote 



the distribution of v by F. Moreover, let a„ = inf \^x\F{x) > 1 — ^} (as in Definition 18). We start 
by showing an analogue of Lemma [26) 

Lemma 54. If S = Con[v > (1 + e)an], then S < ^ill^l^ 



Using Lemma 54 and Lemma [32| we deduce that if we constrain our prices to be < (1 + e)an; 
we lose no more most ^^^^J"" revenue. Given that the optimal revenue with the restriction that all 
prices be < (1 + e)a„ is at most (1 + e)a„, it follows that the optimal revenue without the restriction 
is at most (1 + e)a„ + = (l + e)(l + ^)a„. This is very close to an if n is a sufficiently large 

function of e. If that's the case, it suffices to find a price vector achieving revenue close to a„. 

Lemma 55. // we use the price vector P = ((1 — e)an, (1 — e)an, . . . , (1 — e)a„), we receive revenue 
at least (l — e^~^'^ — e) 

Notice that, when n > (l/e)^/'^, n'^ > 1/e. In this case, we have shown that OPT < (1 + e)(l + 



6e)ara ^ (1 + 8e)a„. On the other hand. Lemma 55, says that we can achieve revenue at least 
(1 — ^J77 — e)a„ using a single price. Since e^^^ > 1/e, this revenue is at least (1 — 2e)a„. Given that 
(1 -f- 8e)(l — lOe) < (1 — 2e), we have (1 — 2e)an > (1 — 10e)OPT. So if we set the price for every item 
to be (1 — e)a„, we achieve a revenue that is at least (1 — 10e)OPT. 

Theorem 56. If the values of the buyer are i.i.d. according to a MHR distribution, there is a PTAS 
for finding a price vector that achieves a (1 — e)- fraction of the optimal revenue. The algorithm 

, log(l/e) 

runs in time linear in log(-^^) and polynomial in 2 and the size of the input. Moreover, if 

n ^ there exists an efficiently computable price such that, if all items are priced at this 

price, the resulting revenue is at least (1 — e)OPT. 
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L Proofs Omitted from Section IK 



Lemma 



□ 



21 



Proof of Lemma 54' By Lemma 19, we know that (1 + e)an > Thus, S < Con[v > a^i+t]. But 

gives Con[v > a„i+£] < Qa^i+t / v}^'^ . Hence, 



^ ^ 6a„i+. ^ 6(1 + e)a^ 



n 



n 



l+e 



Proof of Lemma 55: Let p = (1 — e)a„. By Lemma 19 we know that -jfz:^ — '^n- Hence, for all i 



Pr[z;j <p]< Pr[fi < a^i-^] < 1 



n 



It follows that 



Prpi, Vi>p\>l - [I 



n 



> 1 



Hence, with probabihty at least 1 — the buyer will purchase an item and will pay p. Hence, 

the revenue is at least (1 — e^~^'''^){l — e)an > (1 — e*^~"^^ — e)a„. □ 



Proof of Theorem 56: Let e' = e/12. Depending on the value of n our algorithm proceeds in one of 



the following ways: 

• If n > (IJe'Y/''' , we do binary search starting at an anchoring point of the distribution (see 
Section C| to find some p G [1 — e', l+e']a„,. This takes time 0(log(^^^) and the size of the input, 
since a„ < 02 • log2 n. We then set every item's price to (1 — 2e')p. Since (1 — 2e')p < (1 — (')otn-, 

Pr[3 i, Vi>{l- 2e')p\ > Pr[3 i, Vi > {1 - e')an]. 

On the other hand, (1 — 2e')p > (1 — 2e')(l — €')an- Thus, the revenue we obtain if we price all 
items at (1 — 2e')p is at least (1 — 2e') times the revenue under price vector P = ((1 — e')an,(l — 
e')an, . . . , (1 - e')an). Hence, the revenue is at least (1 - 12e')OPT = (1 - e)OPT. 



If n < (1/e')^/'^', we simply use the algorithm for non-i.i.d. case (Corollary 52). 



□ 
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